Feature #1031
RCP: Add structures start positions to the R object produced by the Corpus SendToR command
Statut: | New | Début: | 03/10/2014 | |
---|---|---|---|---|
Priorité: | Normal | Echéance: | ||
Assigné à: | - | % réalisé: | 0% |
|
Catégorie: | Stats / R | Temps passé: | - | |
Version cible: | TXM 0.X.X |
Description
We can add structures (text, p, s, etc.) start-end positions to the R object (Dataframe) created.
Solution :
- add a 'struct' vector of structure vectors
- add a vector per text structure in the 'struct vector of name 'text'
- add a 'lex' vector of lexicons (lexicons must loose their 'lex' string in their name)
- each vector element is a vector of [start, end] Integer positions
- position values start at 0 for the first word of a corpus and a sub-corpus
- priority is for the 'text' structure element to export #870
'text' structure is already done except for 4) and 5) see ticket #870