Feature #1508

TBX: 0.7.7, transfert some HTML equivalent elements from source to HTML edition pages in XML import module

Added by Serge Heiden about 8 years ago. Updated almost 8 years ago.

Status:New Start date:09/19/2015
Priority:Normal Due date:
Assignee:- % Done:

80%

Category:Edition Spent time: -
Target version:TXM 0.7.8

Description

Currently the pager of the XML/w+CSV import module doesn't transfer any element having some HTML semantics to the HTML edition pages although they are present in the WTC source, that is those structures are available for CQL queries.

It may be useful to have some HTML equivalent source elements available in corpus editions. For example, web links (<a> elements) which could be interpreted by the current edition HTML renderer. Other elements could be considered: <p>, <i>... (most HTML elements)

Discussion

Default XML/w+CSV import module should not interpret any semantics of XML elements. But it is currently already interpreting TEI elements semantics for various purpose. As for the TEI semantics aware XTZ importer that we have to develop, we may add specific semantics mapping from some TEI elements found in the sources to some (X)HTML elements rendered in the HTML editions.

Solution for links

Produce a '<a href="url" target="output">url</a>' element in HTML pages if present in the source as '<ref target="url">url</ref>'. The 'output' frame could receive all links followed by default.

MD: ok but the content of the a elements is tokenized

History

#1 Updated by Alexey Lavrentev about 8 years ago

Should not we try to integrate the XTZImporter macro (#1288) that provides a more general solution to preserving XML structures in the HTML edition pages?

#2 Updated by Matthieu Decorde almost 8 years ago

  • Description updated (diff)

#3 Updated by Matthieu Decorde almost 8 years ago

  • % Done changed from 0 to 80

#4 Updated by Serge Heiden almost 8 years ago

  • Subject changed from TBX: 0.7.7, transfert some HTML elements from source to HTML edition pages in XML import module to TBX: 0.7.7, transfert some HTML equivalent elements from source to HTML edition pages in XML import module
  • Description updated (diff)

Also available in: Atom PDF