Feature #1596

Mis à jour par Serge Heiden il y a presque 10 ans

Currently, <w> is the default implicit element name used to encode word level units (leafs) of the TXM corpus model in XML sources for all the XML based import modules. Some corpora need to be able to associate to the word level smaller linguistic units encoded by other XML elements:
* letters (for example by the <c> element to encode TEI characters)
* syllables
* etc.

h3. Solution

Make the word level XML element name public and customizable.

Add a new import parameter that declares the XML element that encodes the CQP words/tokens.

"w" is the default value for that parameter.

All import module steps are concerned by this new parameter.

note: only available in XML-XTZ import module for now

Retour