Feature #1596

Mis à jour par Serge Heiden il y a presque 10 ans

Currently, <w> is Usually the default implicit element used to encode word level units (leafs) of "w" tags encodes the TXM words. Some corpus model in all the XML based import modules. Some corpora need needs to be able to associate to the word level smaller linguistic units:
* letters (characters)
* syllables
* etc.
see characters (or letters) as words.

h3. Solution

Make Add the word level XML element name public and costumizable.

Add a
new import parameter that declares defines the XML element that encodes the CQP words/tokens.

If not specified, "w" is the default value for that parameter. word element.

All import module steps are concerned by with this new parameter.

Retour