Task #2997

Vocapia2Transcriber, Vocapia to Transcriber conversion macro

Added by Matthieu Decorde about 1 month ago. Updated about 1 month ago.

Status:New Start date:01/19/2021
Priority:Normal Due date:
Assignee:- % Done:

80%

Category:Import Spent time: -
Target version:TXM 0.8.2 - 13NOV 1.0

Description

Location : "transcription" macro directory

Parameters
  • vocapia file : process only one file
  • vocapia directory : process the XML files of the directory
  • result directory
  • retokenize_words : true (false to keep vocapia tokenization)
    Conversion rules :
  • Word -> w
    • @stime -> @time
    • @stime -> @start
    • @etime -> @end
    • all other attributes are transfered (conf, dur...)
    • fix tokenization for TXM "j'ai" -> "j'" "ai" see #3004, fixing quickly the tokenisation raises the same problems than tokenizing text. Hence, it is better to implement the re-tokenize import option
  • AudioDoc -> Trans
  • SpeakerList -> Speakers
  • SegmentList -> Episode + Section
  • SpeechSegment -> Turn

History

#1 Updated by Matthieu Decorde about 1 month ago

  • Subject changed from Import, Transcriber, show all word properties in edition word titles to Vocapia to Transcriber conversion macro

#2 Updated by Serge Heiden about 1 month ago

  • Description updated (diff)

#3 Updated by Matthieu Decorde about 1 month ago

  • Subject changed from Vocapia to Transcriber conversion macro to Vocapia2Transcriber, Vocapia to Transcriber conversion macro
  • Description updated (diff)
  • % Done changed from 0 to 80

#4 Updated by Matthieu Decorde about 1 month ago

  • Description updated (diff)
  • Target version changed from TXM - 13NOV 1.0 to TXM 0.8.2 - 13NOV 1.0

#5 Updated by Matthieu Decorde about 1 month ago

  • Description updated (diff)

#6 Updated by Matthieu Decorde about 1 month ago

  • Description updated (diff)

Also available in: Atom PDF