Bug #3230

Import, lang detection fails on small files

Added by Matthieu Decorde over 1 year ago. Updated 4 months ago.

Status:New Start date:02/22/2022
Priority:Normal Due date:
Assignee:- % Done:

80%

Category:Import Spent time: -
Target version:TXM 0.8.3

Description

Error message:

Args:
.TXM-0.8.2\configuration\org.eclipse.osgi\98\data\-1285204199\plugins\org.txm.treetagger.core.win32_1.0.0.202201251609\res\win32/bin/tree-tagger -token -lemma -sgml -no-unknown -cap-heuristics -quiet -eos-tag <s> TXM-0.8.2\treetagger-models\??.par TXM-0.8.2\corpora\POEMES-DE-QUI\ptreetagger\Alexandrins.xml-src.tt TXM-0.8.2\corpora\POEMES-DE-QUI\treetagger\Alexandrins.xml-out.tt
java.io.FileNotFoundException: TXM-0.8.2\corpora\POEMES-DE-QUI\treetagger\Alexandrins.xml-out.tt (Le fichier spécifié est introuvable)
    at java.io.FileInputStream.open0(Native Method)
    at java.io.FileInputStream.open(Unknown Source)
    at java.io.FileInputStream.<init>(Unknown Source) 

Solution

When the input file is too small, no detection is done and the model was not set. We now use the defautl locale lang

History

#1 Updated by Matthieu Decorde over 1 year ago

  • % Done changed from 0 to 80

#2 Updated by Matthieu Decorde 4 months ago

  • Target version changed from TXM 0.8.2 to TXM 0.8.3

Also available in: Atom PDF