Feature #1214: RCP: x.x, detect the missing texts in the metadata.csv file before running import - Plateforme TXM - Forge du Centre Blaise Pascal

Feature #1214

RCP: x.x, detect the missing texts in the metadata.csv file before running import

Ajouté par Matthieu Decorde il y a presque 11 ans. Mis à jour il y a presque 11 ans.

Statut:

New

Début:

23/12/2014

Priorité:

Normal

Echéance:

Assigné à:

% réalisé:

Catégorie:

Import

Temps passé:

Version cible:

TXM 0.X.X

Description

Help the user to find out metadata.csv file errors before running the import

Solution¶

The metadata tab could highlight wrong lines.
A wrong line is a line which:

text source file has not been found.
a value is empty
has a column not well formatted
...

Demandes liées

Historique

#1 Mis à jour par Serge Heiden il y a presque 11 ans

I don't think that the solution is to move some work at another moment (before the import).

The solution first consists in giving a precise and exhaustive diagnostic message during the import process:

if a text file name is not found in 'metadata.csv', the following warning message should be given on one line "Warning: impossible to find a metadata line for the text of name 'X Y' (filename is 'X Y.xml')." [the quotes '...' in the message should help to diagnoze number of spaces problems]
if the fact that a metadata value is empty is a problem or should be diagnozed, this should be a processing option. For example: EmptyMetadataCellProcessing with values 'Ignore' (take appropriate action so that TXM is not broken but the corpus imported), 'Warning' (print warning message + take appropriate action so that TXM is not broken but the corpus imported - default value) or 'Error' (print error message - metadata name, text name, metadata.csv line number + stop the import).
if a metadata.csv line is diagnozed as unreadable and this could be considered as an error, this should be a processing option. For example: UnreadableMetadataLineProcessing with values 'Warning' (print warning message + take appropriate action so that TXM is not broken but the corpus imported - default value) or 'Error' (print error message - text name, metadata.csv line number + stop the import).

If we want later to design an independant UI from the import process to help diagnoze metadata.csv files, we should relate that to a metadata editor component (including table import and table edition capabilities) or a properties editor component (with the distinction text properties (metadata) / structures properties / lexical properties).

Formats disponibles : Atom PDF

Laboratoire ICAR » Plateforme TXM

Demandes

Rapports personnalisés

Feature #1214

RCP: x.x, detect the missing texts in the metadata.csv file before running import

Solution¶

Historique

#1 Mis à jour par Serge Heiden il y a presque 11 ans