Bug #528

Mis à jour par Alexey Lavrentev il y a plus de 11 ans

Some xml tags from the source document appear as words in lexical indexes, e.g. <pre></?ab.*></pre> "</?ab.*>" in Schiller corpus (check source documents and binary corpus at /SpUV/Schiller).

The same sources were correctly imported with TXM 0.7.2 with the same parameters...

In the BVHEPISTEMON2014 corpus, such misinterpreted tags are very numerous.

Retour