Bug #1251: RCP: 0.7.6, fix tokenizer parameters fields in import form editor - Plateforme TXM - Forge du Centre Blaise Pascal

Bug #1251

RCP: 0.7.6, fix tokenizer parameters fields in import form editor

Ajouté par Serge Heiden il y a plus de 10 ans. Mis à jour il y a plus de 4 ans.

Statut:

Closed

Début:

04/02/2015

Priorité:

Normal

Echéance:

Assigné à:

-

% réalisé:

100%

Catégorie:

Import

Temps passé:

-

Version cible:

Description

the word space separators field default value '[\p{Z}|\p{C}]+' is incorrect -> fix it to '[\p{Z}\p{C}]+'
- MD: OK fixed in 0.7.7
the end of sentence characters field default value '[\p{Ps}|\p{Pe}|\p{Pi}|\p{Pf}|\p{Po}|\p{S}]' is incorrect -> fix it to '[\p{Ps}\p{Pe}\p{Pi}\p{Pf}\p{Po}\p{S}]'
- MD: OK fixed in 0.7.7
when any tokenizer parameter field is edited, the import module is always run with the "MISSING TOKENIZER KEY: punct_strong" message, which means that orthographic sentences are never analyzed?
- MD: OK fixed in 0.7.7, the "punct_strong" was not well updated causing this message display
what is the purpose of the 'OK' button? -> suggestion: remove the 'OK' button
- MD: OK removed the "OK" button in 0.7.7

see ticket #1347 for more tokenizer import fixes

Historique

#1 Mis à jour par Serge Heiden il y a plus de 10 ans

Description mis à jour (diff)

#2 Mis à jour par Serge Heiden il y a plus de 10 ans

Description mis à jour (diff)

#3 Mis à jour par Matthieu Decorde il y a plus de 10 ans

Description mis à jour (diff)
Statut changé de New à In Progress
% réalisé changé de 0 à 30

#4 Mis à jour par Matthieu Decorde il y a plus de 10 ans

Description mis à jour (diff)
% réalisé changé de 30 à 40

#5 Mis à jour par Matthieu Decorde il y a plus de 10 ans

Description mis à jour (diff)
% réalisé changé de 40 à 80

#6 Mis à jour par Matthieu Decorde il y a plus de 4 ans

Statut changé de In Progress à Closed
% réalisé changé de 80 à 100

Formats disponibles : Atom PDF