Task #2997

Mis à jour par Matthieu Decorde il y a plus de 4 ans

Location : "transcription" macro directory

Parameters
* vocapia file : process only one file
* vocapia directory : process the XML files of the directory
* result directory
* retokenize_words : true (false to keep vocapia tokenization)
Conversion rules :
* Word -> w
** @stime -> @time
** @stime -> @start
** @etime -> @end
** all other attributes are transfered (conf, dur...)
** fix tokenization for TXM "j'ai" -> "j'" "ai"
* AudioDoc -> Trans
* SpeakerList -> Speakers
* SegmentList -> Episode + Section
* SpeechSegment -> Turn

Retour