Feature #1596
TBX: X.X, new XML-XTZ + CSV import option : word element
Status: | New | Start date: | 11/27/2015 | ||
---|---|---|---|---|---|
Priority: | Normal | Due date: | |||
Assignee: | - | % Done: | 80% |
||
Category: | Import | Spent time: | - | ||
Target version: | TXM - Oriflamms 1.0 |
Description
Currently, <w> is the default implicit element name used to encode word level units (leafs) of the TXM corpus model in XML sources for all the XML based import modules. Some corpora need to be able to associate to the word level smaller linguistic units encoded by other XML elements:
- letters (for example by the <c> element to encode TEI characters)
- syllables
- etc.
Solution¶
Make the word level XML element name public and customizable.
Add a new import parameter that declares the XML element that encodes the CQP words/tokens.
"w" is the default value for that parameter.
All import module steps are concerned by this new parameter.
note: only available in XML-XTZ import module for now
History
#1 Updated by Serge Heiden over 7 years ago
- Description updated (diff)
#2 Updated by Matthieu Decorde over 7 years ago
- Description updated (diff)
#3 Updated by Serge Heiden over 7 years ago
- Description updated (diff)
#4 Updated by Serge Heiden over 7 years ago
- Description updated (diff)
#5 Updated by Matthieu Decorde over 7 years ago
- % Done changed from 0 to 80