Support #1049

KF: Corpus Mariage pour tous: Import or Sub-corpus

Added by Serge Heiden about 5 years ago. Updated about 5 years ago.

Status:New Start date:10/09/2014
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:Import Spent time: -
Target version:Support

Description

From KF:

Problem with 'mariage pour tous' corpus.
import > XML TEI TXM > sous-corpus impossible mais pas de bug évident
import > XML /w + CSV > sous-corpus possible
testé sur corpus MPT (mariage pour tous)

In 'Mariage pour tous' corpus web site (Nicolas Legrand): https://github.com/nlegrand/mariagepourtousInXML

We find two corpora:

Diagnostic 1

Test with binary version:
  • File / Load MPT-TXM_2013-03-20 -> new 'MPT' corpus
  • Sub-corpus structure=metadata, property=debat, value=mpt -> new sub-corpus
  • Lexicon on sub-corpus working -> '24730 items pour 946286 occurrences.'

Conclusion 1

Impossible to reproduce the problem.

Diagnostic 2

Test with source version:

  • File / Import / XML/w+CSV MPT -> new 'MPTSRC' corpus
  • Sub-corpus structure=metadata, property=debat, value=mpt -> new sub-corpus
  • Lexicon on sub-corpus working -> '24730 items pour 946286 occurrences.'

Conclusion 2

Impossible to reproduce the problem.

Diagnostic 3

Test with binary version on Windows 7 64-bit:

Load command should abort with the following error (FR), [from AD]:

Échec de l'extraction du corpus binaire : java.io.FileNotFoundException: ...\TXM\corpora\mpt\data\MPT\metadata_debat.avs (L’opération demandée n’a pu s’accomplir sur un fichier ayant une section mappée utilisateur ouverte)

Conclusion 3

History

#1 Updated by Serge Heiden about 5 years ago

  • Description updated (diff)

#2 Updated by Serge Heiden about 5 years ago

  • Description updated (diff)

#3 Updated by Serge Heiden about 5 years ago

  • Subject changed from TXM 0.7.6 Corpus Mariage pour tous: Import or Sub-corpus to KF: Corpus Mariage pour tous: Import or Sub-corpus

Also available in: Atom PDF