Task #2997
Vocapia2Transcriber, Vocapia to Transcriber conversion macro
Status: | New | Start date: | 01/19/2021 | ||
---|---|---|---|---|---|
Priority: | Normal | Due date: | |||
Assignee: | - | % Done: | 80% |
||
Category: | Import | Spent time: | - | ||
Target version: | TXM 0.8.2 - 13NOV 1.0 |
Description
Location : "transcription" macro directory
Parameters- vocapia file : process only one file
- vocapia directory : process the XML files of the directory
- result directory
- retokenize_words : true (false to keep vocapia tokenization)
Conversion rules : - Word -> w
- @stime -> @time
- @stime -> @start
- @etime -> @end
- all other attributes are transfered (conf, dur...)
fix tokenization for TXM "j'ai" -> "j'" "ai"see #3004, fixing quickly the tokenisation raises the same problems than tokenizing text. Hence, it is better to implement the re-tokenize import option
- AudioDoc -> Trans
- SpeakerList -> Speakers
- SegmentList -> Episode + Section
- SpeechSegment -> Turn
History
#1 Updated by Matthieu Decorde over 2 years ago
- Subject changed from Import, Transcriber, show all word properties in edition word titles to Vocapia to Transcriber conversion macro
#2 Updated by Serge Heiden over 2 years ago
- Description updated (diff)
#3 Updated by Matthieu Decorde over 2 years ago
- Subject changed from Vocapia to Transcriber conversion macro to Vocapia2Transcriber, Vocapia to Transcriber conversion macro
- Description updated (diff)
- % Done changed from 0 to 80
#4 Updated by Matthieu Decorde over 2 years ago
- Description updated (diff)
- Target version changed from TXM - 13NOV 1.0 to TXM 0.8.2 - 13NOV 1.0
#5 Updated by Matthieu Decorde over 2 years ago
- Description updated (diff)
#6 Updated by Matthieu Decorde over 2 years ago
- Description updated (diff)