Feature #200

export, corpus : "light" vs "complete" export option

Ajouté par Benedicte Pincemin il y a plus de 4 ans. Mis à jour il y a environ 4 ans.

Statut:New Début:04/07/2013
Priorité:High Echéance:
Assigné à:- % réalisé:

0%

Catégorie:- Temps passé: -
Version cible:TXM 0.8

Description

Have the possibility to reduce the size of the binary corpus to the minimum necessary size (for example, by deleting some directories like interp, ptokenized, tokenized, treetagger, wtc). This parameter could be set through the parameters interface (tools/parameters/txm/user/export). The default value could be "light" ? (the advantage of choosing the "complete" option should be explaned, anyway).

Historique

#1 Mis à jour par Alexey Lavrentev il y a plus de 4 ans

When choosing the "light" option, the user should be asked if (s)he wants to reduce the binary corpus on his/her system or just for the export. This may help save a lot of disk space.
Probably, a new function "Optimize binary corpora" should be created in the Settings menu as well

#2 Mis à jour par Matthieu Decorde il y a environ 4 ans

  • Priorité changé de Normal à High

#3 Mis à jour par Serge Heiden il y a environ 4 ans

This ticket interfers with the need to keep (or not) intermediary data files during an import process, for import debug or multiple-import operations. Which should be another ticket related to Import.

Otherwise, for the current ticket. Here is a prefered Scenario: the option should not be designed to help to downsize a binary corpus exported, but instead to augment a standard binary corpus (called "light" in the description) with various secondary informations.

Formats disponibles : Atom PDF