Feature #1031

RCP: Add structures start positions to the R object produced by the Corpus SendToR command

Added by Matthieu Decorde over 8 years ago.

Status:New Start date:10/03/2014
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:Stats / R Spent time: -
Target version:TXM X.X

Description

We can add structures (text, p, s, etc.) start-end positions to the R object (Dataframe) created.

Solution :
- add a 'struct' vector of structure vectors
- add a vector per text structure in the 'struct vector of name 'text'
- add a 'lex' vector of lexicons (lexicons must loose their 'lex' string in their name)
- each vector element is a vector of [start, end] Integer positions
- position values start at 0 for the first word of a corpus and a sub-corpus
- priority is for the 'text' structure element to export #870

'text' structure is already done except for 4) and 5) see ticket #870

Also available in: Atom PDF