Bug #2793
XTZ, XML/w and Transcriber Import, XLSX and ODS metadata files not supported when XSL directory is present
| Statut: | Closed | Début: | 08/04/2020 | |
|---|---|---|---|---|
| Priorité: | Urgent | Echéance: | ||
| Assigné à: | - | % réalisé: | 100% |
|
| Catégorie: | Import | Temps passé: | - | |
| Version cible: | TXM 0.8.2 |
Description
This is due to the fact that TXM tries to run XSLT transformations on XSLX and ODS files. The module works properly with metadata.csv.
Démarrage de TXM 0.8.0.2221 (2019-08-30 14h42)…
TXM est prêt.
The JOUBERTXTZOG corpus will be created from the /home/alavrent/Bureau/Ex5/joubert-xtz-og directory.
The 'annotate' import parameter has been activated since TreeTagger is installed.
Sauvegarde des paramètres d'importation…
Démarrage du script d'import Groovy xtzLoader.groovy.
[[id, auteur, titre, extrait, date, ville], [joubert1579_1-02, Joubert, Laurent, Erreurs populaires, Livre 1, ch. 2, 1579, Bordeaux], [joubert1587_1-02, Joubert, Laurent, Erreurs populaires, Livre 1, ch. 2, 1587, Paris]]
-- Split-Merge XSL Step with /home/alavrent/Bureau/Ex5/joubert-xtz-og/xsl/1-split-merge
-- Front XSL Step with the /home/alavrent/Bureau/Ex5/joubert-xtz-og/xsl/2-front directory.
ApplyXsl2 with the /home/alavrent/Bureau/Ex5/joubert-xtz-og/xsl/2-front/01-txm-front-teip5-og-xtz-joubert-removeAncor.xsl stylesheet.
-- Applying /home/alavrent/Bureau/Ex5/joubert-xtz-og/xsl/2-front/01-txm-front-teip5-og-xtz-joubert-removeAncor.xsl XSL to 3 (from /home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src) files with parameters: {output-directory=file:/home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/txm/JOUBERTXTZOG/} on directory /home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src result written in /home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/txm/JOUBERTXTZOG
003 .Error on line 1 column 1 of metadata.xlsx:
SXXP0003: Error reported by XML parser: Contenu non autorisé dans le prologue.
net.sf.saxon.trans.XPathException: org.xml.sax.SAXParseException; systemId: file:/home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src/metadata.xlsx; lineNumber: 1; columnNumber: 1; Contenu non autorisé dans le prologue.
at net.sf.saxon.event.Sender.sendSAXSource(Sender.java:425)
at net.sf.saxon.event.Sender.send(Sender.java:178)
at net.sf.saxon.Controller.transform(Controller.java:1790)
at org.txm.importer.ApplyXsl2.process(ApplyXsl2.java:304)
at org.txm.importer.ApplyXsl2.processImportSources(ApplyXsl2.java:437)
at org.txm.importer.ApplyXsl2.processImportSources(ApplyXsl2.java:363)
at org.txm.importer.ApplyXsl2$processImportSources.call(Unknown Source)
...
at org.txm.core.engines.ScriptedImportEngine.build(ScriptedImportEngine.java:56)
at org.txm.objects.Project._compute(Project.java:320)
at org.txm.core.results.TXMResult.compute(TXMResult.java:2224)
at org.txm.core.results.TXMResult.compute(TXMResult.java:2143)
at org.txm.rcp.handlers.scripts.ExecuteImportScript$2.run(ExecuteImportScript.java:146)
at org.eclipse.core.internal.jobs.Worker.run(Worker.java:56)
Caused by: org.xml.sax.SAXParseException; systemId: file:/home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src/metadata.xlsx; lineNumber: 1; columnNumber: 1; Contenu non autorisé dans le prologue.
at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.createSAXParseException(ErrorHandlerWrapper.java:203)
at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.fatalError(ErrorHandlerWrapper.java:177)
...
at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1213)
at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(SAXParserImpl.java:643)
at net.sf.saxon.event.Sender.sendSAXSource(Sender.java:405)
... 37 more
---------
org.xml.sax.SAXParseException; systemId: file:/home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src/metadata.xlsx; lineNumber: 1; columnNumber: 1; Contenu non autorisé dans le prologue.
at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.createSAXParseException(ErrorHandlerWrapper.java:203)
at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.fatalError(ErrorHandlerWrapper.java:177)
at com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportError(XMLErrorReporter.java:400)
...
at org.txm.scripts.importer.xtz.xtzLoader.run(xtzLoader.groovy:58)
at groovy.util.GroovyScriptEngine.run(GroovyScriptEngine.java:599)
at org.txm.groovy.core.GroovyScriptedImportEngine._build(GroovyScriptedImportEngine.java:123)
at org.txm.core.engines.ScriptedImportEngine.build(ScriptedImportEngine.java:56)
at org.txm.objects.Project._compute(Project.java:320)
at org.txm.core.results.TXMResult.compute(TXMResult.java:2224)
at org.txm.core.results.TXMResult.compute(TXMResult.java:2143)
at org.txm.rcp.handlers.scripts.ExecuteImportScript$2.run(ExecuteImportScript.java:146)
at org.eclipse.core.internal.jobs.Worker.run(Worker.java:56)
ODS :
Sauvegarde des paramètres d'importation…
Démarrage du script d'import Groovy xtzLoader.groovy.
Warning: the 7the column name is empty
-- Split-Merge XSL Step with /home/alavrent/Bureau/Ex5/joubert-xtz-og/xsl/1-split-merge
-- Front XSL Step with the /home/alavrent/Bureau/Ex5/joubert-xtz-og/xsl/2-front directory.
ApplyXsl2 with the /home/alavrent/Bureau/Ex5/joubert-xtz-og/xsl/2-front/01-txm-front-teip5-og-xtz-joubert-removeAncor.xsl stylesheet.
-- Applying /home/alavrent/Bureau/Ex5/joubert-xtz-og/xsl/2-front/01-txm-front-teip5-og-xtz-joubert-removeAncor.xsl XSL to 3 (from /home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src) files with parameters: {output-directory=file:/home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/txm/JOUBERTXTZOG/} on directory /home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src result written in /home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/txm/JOUBERTXTZOG
003 ...Error on line 1 column 1 of metadata.ods:
SXXP0003: Error reported by XML parser: Contenu non autorisé dans le prologue.
net.sf.saxon.trans.XPathException: org.xml.sax.SAXParseException; systemId: file:/home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src/metadata.ods; lineNumber: 1; columnNumber: 1; Contenu non autorisé dans le prologue.
at net.sf.saxon.event.Sender.sendSAXSource(Sender.java:425)
at net.sf.saxon.event.Sender.send(Sender.java:178)
at net.sf.saxon.Controller.transform(Controller.java:1790)
at org.txm.importer.ApplyXsl2.process(ApplyXsl2.java:304)
at org.txm.importer.ApplyXsl2.processImportSources(ApplyXsl2.java:437)
at org.txm.importer.ApplyXsl2.processImportSources(ApplyXsl2.java:363)
...
at org.txm.scripts.importer.xtz.XTZImport.start(XTZImport.groovy:86)
at org.txm.importer.xtz.ImportModule.process(ImportModule.java:242)
at org.txm.importer.xtz.ImportModule$process$2.call(Unknown Source)
at org.txm.scripts.importer.xtz.xtzLoader.run(xtzLoader.groovy:58)
at groovy.util.GroovyScriptEngine.run(GroovyScriptEngine.java:599)
at org.txm.groovy.core.GroovyScriptedImportEngine._build(GroovyScriptedImportEngine.java:123)
at org.txm.core.engines.ScriptedImportEngine.build(ScriptedImportEngine.java:56)
at org.txm.objects.Project._compute(Project.java:320)
at org.txm.core.results.TXMResult.compute(TXMResult.java:2224)
at org.txm.core.results.TXMResult.compute(TXMResult.java:2143)
at org.txm.rcp.handlers.scripts.ExecuteImportScript$2.run(ExecuteImportScript.java:146)
at org.eclipse.core.internal.jobs.Worker.run(Worker.java:56)
Caused by: org.xml.sax.SAXParseException; systemId: file:/home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src/metadata.ods; lineNumber: 1; columnNumber: 1; Contenu non autorisé dans le prologue.
at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.createSAXParseException(ErrorHandlerWrapper.java:203)
at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.fatalError(ErrorHandlerWrapper.java:177)
at com.sun.org.apache.xerces.internal.impl.XMLErrorReporter.reportError(XMLErrorReporter.java:400)
...
at com.sun.org.apache.xerces.internal.parsers.XMLParser.parse(XMLParser.java:141)
at com.sun.org.apache.xerces.internal.parsers.AbstractSAXParser.parse(AbstractSAXParser.java:1213)
at com.sun.org.apache.xerces.internal.jaxp.SAXParserImpl$JAXPSAXParser.parse(SAXParserImpl.java:643)
at net.sf.saxon.event.Sender.sendSAXSource(Sender.java:405)
... 31 more
---------
org.xml.sax.SAXParseException; systemId: file:/home/alavrent/TXM-0.8.0/corpora/JOUBERTXTZOG/src/metadata.ods; lineNumber: 1; columnNumber: 1; Contenu non autorisé dans le prologue.
at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.createSAXParseException(ErrorHandlerWrapper.java:203)
at com.sun.org.apache.xerces.internal.util.ErrorHandlerWrapper.fatalError(ErrorHandlerWrapper.java:177)
a...
at org.txm.scripts.importer.xtz.XTZImport.start(XTZImport.groovy:86)
at org.txm.importer.xtz.ImportModule.process(ImportModule.java:242)
at org.txm.importer.xtz.ImportModule$process$2.call(Unknown Source)
at org.txm.scripts.importer.xtz.xtzLoader.run(xtzLoader.groovy:58)
at groovy.util.GroovyScriptEngine.run(GroovyScriptEngine.java:599)
at org.txm.groovy.core.GroovyScriptedImportEngine._build(GroovyScriptedImportEngine.java:123)
at org.txm.core.engines.ScriptedImportEngine.build(ScriptedImportEngine.java:56)
at org.txm.objects.Project._compute(Project.java:320)
at org.txm.core.results.TXMResult.compute(TXMResult.java:2224)
at org.txm.core.results.TXMResult.compute(TXMResult.java:2143)
at org.txm.rcp.handlers.scripts.ExecuteImportScript$2.run(ExecuteImportScript.java:146)
at org.eclipse.core.internal.jobs.Worker.run(Worker.java:56)
Solution¶
update ApplyXSL file filters and fix data source selection in XML/w and XTZ import module
Validation test¶
- download attached archive: xslodsmetadata.zip
- import with XML/w and XTZ
- the import don't fail and only one text "t1" is present
Révisions associées
fix xlsx extension filtering refs #2793
Historique
#1 Mis à jour par Matthieu Decorde il y a plus de 5 ans
- Catégorie mis à Import
- Version cible changé de TXM 0.8.2 à TXM 0.8.1
#2 Mis à jour par Matthieu Decorde il y a plus de 5 ans
- Sujet changé de RCP: 0.8.0, XTZ Import, XLSX and ODS metadata files not supported when XSL directory present à 0.8.0, XTZ Import, XLSX and ODS metadata files not supported when XSL directory is present
#3 Mis à jour par Matthieu Decorde il y a plus de 5 ans
- % réalisé changé de 0 à 80
- Fichier xslodsmetadata.zip ajouté
- Description mis à jour (diff)
#4 Mis à jour par Serge Heiden il y a plus de 4 ans
- Sujet changé de 0.8.0, XTZ Import, XLSX and ODS metadata files not supported when XSL directory is present à XTZ, XML/w and Transcriber Import, XLSX and ODS metadata files not supported when XSL directory is present
- Priorité changé de Normal à Urgent
- Version cible changé de TXM 0.8.1 à TXM 0.8.2
- % réalisé changé de 80 à 0
Bug still present in Transcriber import module.
#5 Mis à jour par Matthieu Decorde il y a plus de 4 ans
- Fichier xslodsmetadata.zip ajouté
- % réalisé changé de 0 à 80
update attached test directory
#6 Mis à jour par Sebastien Jacquot il y a presque 2 ans
- % réalisé changé de 80 à 100
#7 Mis à jour par Sebastien Jacquot il y a presque 2 ans
- Statut changé de New à Closed