Feature #2050

Mis à jour par Serge Heiden il y a plus de 8 ans

See https://groupes.renater.fr/wiki/txm-info/public/import_xtz#etape_txm_079. This ticket groups various features that need improvement in the XTZ module.

# Use $pagination-element and not pb when building the facs edition, see #2146.
# Simplify producing multi-facetted editions (from XML-TXM format)
#* it is currently very complicated to keep tags (e.g page breaks) inside non default edition facet
# When 'Tokenize' option is off, check that all word-elements have @id
# Handle XML-TXM as input format
#* currently XML-TXM can be imported via XTZ module (word properties and default editions are correct) but injection of morpho-syntactic annotation (when annotating with TreeTagger) is broken due to nesting txm:form elements (from the source and from the annotation process)
#* TXM should detect txm:form and txm:ana child nodes of the word element and transfer them correctly to XML-TXM
# Handle nesting word-level elements
#* currently, if you have num/w in the source file, XTZ nests w elements in the XML-TXM file (num/w -> w/w)
#* if nesting word-level elements are detected, only the lowest level should be considered as token by TXM
# implement alternative ways of defining text order in the corpus
## text/@textorder
## textorder XPath
## textorder column in metadata.csv (currently implemented)
# implement defining metadata through XPath (as in current TEI-BFM module)
# add @n to milestone element indexation (cf. #1597, #2162)
** start n numbering at 1 for ALL automatic @n

Retour