Bug #2821
Import, broken generated word id
Status: | New | Start date: | 05/15/2020 | ||
---|---|---|---|---|---|
Priority: | Normal | Due date: | |||
Assignee: | - | % Done: | 80% |
||
Category: | Import | Spent time: | - | ||
Target version: | TXM 0.8.1 |
Description
Associated revisions
fix word id generation refs #2821
History
#1 Updated by Matthieu Decorde about 3 years ago
- Description updated (diff)
#2 Updated by Matthieu Decorde about 3 years ago
- Description updated (diff)
#3 Updated by Matthieu Decorde almost 3 years ago
- % Done changed from 0 to 80
to be tested in next setup/update
#4 Updated by Alexey Lavrentev almost 3 years ago
Test works fine as formulated in https://groupes.renater.fr/wiki/txm-users/public/retours_de_bugs_logiciel/txm_0.8.1beta#retours.
But several problems persist:- the letters in existing Id are converted to lower case;
- no check on duplicate id is run. To test:
- Create a t1.xml and paste the following content:
<text id="T1"> <w id="w_recup_0">début</w> du texte. <w id="11">les</w> <w id="w_t1_2">mots</w> <w id="w_T1_3">suivant</w> <w id="4">du</w> <w id="5">textes</w> <w id="7">.</w> <w id="w_recup_8">fin</w> <w id="w_recup_9">.</w> </text>
- Use XML/W+CSV import module
- Make a lexicon of the "id" word property
- You will get:
w_t1_2 2 w_t1_3 2 w_11 1 w_4 1 w_5 1 w_7 1 w_recup_0 1 w_recup_8 1 w_recup_9 1 w_sans_titre1_4 1
#5 Updated by Matthieu Decorde almost 3 years ago
- % Done changed from 80 to 60
need to fix existing ID with min&maj characters