Feature #1641

RCP: X.X, Synoptic import module

Ajouté par Serge Heiden il y a plus de 9 ans. Mis à jour il y a plus de 6 ans.

Statut:New Début:21/01/2016
Priorité:Normal Echéance:
Assigné à:- % réalisé:

0%

Catégorie:Import Temps passé: -
Version cible:TXM 0.X.X

Description

A lot of projects need to compare raw OCR results or transcriptions with original page images.

XTZ import module helps a lot to easily build synoptic editions, but it is also rather complex with all the various features it provides.

The idea is to help people use TXM to build and use synoptic editions, and possibly to use more tools like concordances to explore OCR errors efficiently and maybe discover more TXM tools in the end.

Solution

Build a simplified, and reduced, import module UI (based on the XTZ import module) that selects only a source directory as input and builds a corpus with synoptic editions.

The necessary source directory structure with XML-TEI files and images files or URLs is documented extensively to prepare the sources correctly before calling the import module.

The import module (XTZ) should provide progressive, systematic, extensive and comprehensive diagnostic messages while importing the corpus to help debug the corpus sources.

Such a corpus could be opened in a new simplified, and reduced, Edition Perspective.

Historique

#1 Mis à jour par Sebastien Jacquot il y a plus de 7 ans

  • Version cible changé de TXM 0.8.0a (split/restructuration) à TXM 0.8.0

#2 Mis à jour par Matthieu Decorde il y a plus de 6 ans

  • Version cible changé de TXM 0.8.0 à TXM 0.X.X

Formats disponibles : Atom PDF