Feature #1596

TBX: X.X, new XML-XTZ + CSV import option : word element

Added by Matthieu Decorde almost 4 years ago. Updated almost 4 years ago.

Status:New Start date:11/27/2015
Priority:Normal Due date:
Assignee:- % Done:

80%

Category:Import Spent time: -
Target version:TXM - Oriflamms 1.0

Description

Currently, <w> is the default implicit element name used to encode word level units (leafs) of the TXM corpus model in XML sources for all the XML based import modules. Some corpora need to be able to associate to the word level smaller linguistic units encoded by other XML elements:
  • letters (for example by the <c> element to encode TEI characters)
  • syllables
  • etc.

Solution

Make the word level XML element name public and customizable.

Add a new import parameter that declares the XML element that encodes the CQP words/tokens.

"w" is the default value for that parameter.

All import module steps are concerned by this new parameter.

note: only available in XML-XTZ import module for now

History

#1 Updated by Serge Heiden almost 4 years ago

  • Description updated (diff)

#2 Updated by Matthieu Decorde almost 4 years ago

  • Description updated (diff)

#3 Updated by Serge Heiden almost 4 years ago

  • Description updated (diff)

#4 Updated by Serge Heiden almost 4 years ago

  • Description updated (diff)

#5 Updated by Matthieu Decorde almost 4 years ago

  • % Done changed from 0 to 80

Also available in: Atom PDF