Feature #1031
RCP: Add structures start positions to the R object produced by the Corpus SendToR command
Status: | New | Start date: | 10/03/2014 | |
---|---|---|---|---|
Priority: | Normal | Due date: | ||
Assignee: | - | % Done: | 0% |
|
Category: | Stats / R | Spent time: | - | |
Target version: | TXM X.X |
Description
We can add structures (text, p, s, etc.) start-end positions to the R object (Dataframe) created.
Solution :
- add a 'struct' vector of structure vectors
- add a vector per text structure in the 'struct vector of name 'text'
- add a 'lex' vector of lexicons (lexicons must loose their 'lex' string in their name)
- each vector element is a vector of [start, end] Integer positions
- position values start at 0 for the first word of a corpus and a sub-corpus
- priority is for the 'text' structure element to export #870
'text' structure is already done except for 4) and 5) see ticket #870