Feature #1031

RCP: Add structures start positions to the R object produced by the Corpus SendToR command

Ajouté par Matthieu Decorde il y a plus de 4 ans.

Statut:New Début:03/10/2014
Priorité:Normal Echéance:
Assigné à:- % réalisé:

0%

Catégorie:Stats / R Temps passé: -
Version cible:TXM X.X

Description

We can add structures (text, p, s, etc.) start-end positions to the R object (Dataframe) created.

Solution :
- add a 'struct' vector of structure vectors
- add a vector per text structure in the 'struct vector of name 'text'
- add a 'lex' vector of lexicons (lexicons must loose their 'lex' string in their name)
- each vector element is a vector of [start, end] Integer positions
- position values start at 0 for the first word of a corpus and a sub-corpus
- priority is for the 'text' structure element to export #870

'text' structure is already done except for 4) and 5) see ticket #870

Formats disponibles : Atom PDF