Bug #528
Mis à jour par Alexey Lavrentev il y a plus de 11 ans
Some xml tags from the source document appear as words in lexical indexes, e.g. <pre></?ab.*></pre> "</?ab.*>" in Schiller corpus (check source documents and binary corpus at /SpUV/Schiller).
The same sources were correctly imported with TXM 0.7.2 with the same parameters...
In the BVHEPISTEMON2014 corpus, such misinterpreted tags are very numerous.
The same sources were correctly imported with TXM 0.7.2 with the same parameters...
In the BVHEPISTEMON2014 corpus, such misinterpreted tags are very numerous.