Support #873

Under Windows, BFM import module fails on corpora composed of densely XML tagged texts

Added by Matthieu Decorde about 5 years ago. Updated about 5 years ago.

Status:New Start date:06/18/2014
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:Import Spent time: -
Target version:Known bugs

Description

Under Windows, when the XML sources are too dense, the import module can crash.

Origin: Under Windows, the 'cwb-encode' indexing tool cannot manage XML sources with to many tags, attributes and depth (too many opened files).

Solution:
  1. reduce the number of tags or attributes or the depth of XML sources;
  2. or, import the corpus from another operating system.

The problem has been submited to the CWB project:
https://sourceforge.net/p/cwb/bugs/59

History

#1 Updated by Serge Heiden about 5 years ago

  • Subject changed from Windows BFM import fails with 'big' corpus to Under Windows, BFM import module fails on corpora composed of densely XML tagged texts

#2 Updated by Matthieu Decorde about 5 years ago

During the CQP corpus index building process in import modules, there are too many opened files and cwb-encode (the component that writes CQP indexes) stops.

#3 Updated by Serge Heiden about 5 years ago

  • Description updated (diff)

#4 Updated by Serge Heiden about 5 years ago

  • Description updated (diff)

Also available in: Atom PDF