Task #2998

13NOV, prepare transcription macro V1

Added by Matthieu Decorde about 1 month ago. Updated 2 days ago.

Status:New Start date:01/19/2021
Priority:Normal Due date:
Assignee:- % Done:

80%

Category:Import Spent time: -
Target version:TXM - 13NOV 1.0

Description

Transcriptions fixes:
  • add the otherNonPrimaryLocutor parameter -> used to create the other turn of the primary locutor
  • convert asterisked word sequences to turns ("*abc ... xyz*") with:
    • @who=interviewee code OR other code
      • if the spk does not matches the primarySpeakerIdRegex parameter, then @who must be set with the primary speaker id
      • primary speaker id <- first speaker ID matching the primarySpeakerIdRegex regex
    • @orig-who=<locutor of the original containing turn>
  • split tokens ending with punctuations ("abc,") -> #3004
  • XXX -> event ponctuel dont desc = XXX
  • " -> "rapp1" & "rapp2" events

Fix-set the tokenizer to manage:

réintroduire les règles que nous appliquons usuellement dans TXM, notamment les cas particuliers sur les apostrophes (aujourd'hui, quelqu'un...), et les tirets d'interrogations/exclamations (est-ce, est-il, avez-vous, peut-on, semble-t-il, allez-y, excusez-moi) ou autres (mois-ci, moment-là, moi-même, nous-mêmes, jour-même).

Solution

see macros projects/13nov/FixTranscriptions

History

#1 Updated by Serge Heiden about 1 month ago

  • Description updated (diff)

#2 Updated by Serge Heiden about 1 month ago

  • Description updated (diff)

#3 Updated by Serge Heiden about 1 month ago

  • Description updated (diff)

#4 Updated by Matthieu Decorde about 1 month ago

  • Subject changed from 13NOV fix macro to 13NOV fix transcription macro
  • Description updated (diff)

#5 Updated by Matthieu Decorde about 1 month ago

  • Description updated (diff)

#6 Updated by Matthieu Decorde about 1 month ago

test

#7 Updated by Matthieu Decorde about 1 month ago

  • Description updated (diff)

#8 Updated by Matthieu Decorde 24 days ago

  • Description updated (diff)

#9 Updated by Matthieu Decorde 24 days ago

  • % Done changed from 0 to 50

all is implemented except the rapp1&2 events

#10 Updated by Matthieu Decorde 20 days ago

  • Description updated (diff)
  • % Done changed from 50 to 30

#11 Updated by Matthieu Decorde 20 days ago

  • Description updated (diff)

#12 Updated by Matthieu Decorde 20 days ago

  • Description updated (diff)

#13 Updated by Matthieu Decorde 17 days ago

  • Description updated (diff)
  • % Done changed from 30 to 20

#14 Updated by Matthieu Decorde 13 days ago

  • Description updated (diff)

#15 Updated by Matthieu Decorde 13 days ago

  • Subject changed from 13NOV fix transcription macro to 13NOV, prepare transcription macro V1

#16 Updated by Matthieu Decorde 2 days ago

  • % Done changed from 20 to 80

done r3030

Also available in: Atom PDF