Bug #3270

XML TS Import fails

Added by Alexey Lavrentev 2 months ago. Updated 24 days ago.

Status:New Start date:07/29/2022
Priority:Normal Due date:
Assignee:- % Done:

80%

Category:Import Spent time: -
Target version:TXM 0.8.2

Description

TXM 0.8.2 0.8.2.202206201458
Syntatic Annotation 1.0.0.202206290953

Tested on PROFITEROLE-GOLD-V1-0 tiger-xml files (Sharedocs Cactus/Projets/Profiterole/corpus/PROFITEROLE-GOLD-V1-0_tiger-xml.zip)

Sauvegarde des paramètres d'importation…
Compiling tigersearch import module...
Démarrage du module d'import "tigersearch"...
Import du corpus...
-- IMPORTER - Reading source files
Using [/home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/Lapidfp-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/jehpar_gold-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/beroul-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/grchron_j2c5_gold-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/commyn1_gold-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/YvainKu-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/roland-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/strasbBfm-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/slethgier-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/AlexisRaM-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/aucassin-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/AlexisProlRaM-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/qgraal_cm-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/qlr-00001.xml] as TIGER XML source files.
TIGER-XML files: [/home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/Lapidfp-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/jehpar_gold-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/beroul-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/grchron_j2c5_gold-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/commyn1_gold-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/YvainKu-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/roland-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/strasbBfm-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/slethgier-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/AlexisRaM-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/aucassin-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/AlexisProlRaM-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/qgraal_cm-00001.xml, /home/alavrent/Documents/Projets/ANR - Profiterole/corpus/gold/tiger-xml/qlr-00001.xml]
-- Applying '/home/alavrent/TXM-0.8.2/xsl/ts.xsl' XSL to 14 files from  directory '/home/alavrent/TXM-0.8.2/corpora/PROFITEROLE-GOLD-TIGER-XML/src' with parameters: {output-directory=file:/home/alavrent/TXM-0.8.2/corpora/PROFITEROLE-GOLD-TIGER-XML/tokenized/} result written in '/home/alavrent/TXM-0.8.2/corpora/PROFITEROLE-GOLD-TIGER-XML/tokenized'
 14 .........1....
-- Building (14 XML-TXM files)
 14 .........1....
-- COMPILING - Building Search Engine indexes
-- Scanning structures&properties to create for 14 texts...
 14 .........1....
-- Building CQP files 14/14...
 14 .........1....
-- Running cwb-encode...
 Word properties: id, mor, textid, pos, editionId, lemma
 Structures: p:0+n, s:0+id+n, text:0+id+base+project+name, txmcorpus:0+lang
 14 .........1....
-- Running cwb-makeall...
-- EDITION - Building editions
-- Building 'default' edition of 14/14 texts...
 14 .........1....
--- Copying subdirectories [xsl, css, dtd, doc]

Building TIGER driver file: /home/alavrent/TXM-0.8.2/corpora/PROFITEROLE-GOLD-TIGER-XML/src/corpus.xml...
Feature values to skiip when indexing corpus: []
Import failed.
Corpus "PROFITEROLE-GOLD-TIGER-XML" supprimé(e).

History

#1 Updated by Matthieu Decorde 24 days ago

  • % Done changed from 0 to 80

the subcorpus tags were malformed

Also available in: Atom PDF