Feature #200

RCP: X.X, export corpus options

Added by Benedicte Pincemin over 6 years ago. Updated 19 days ago.

Status:New Start date:07/04/2013
Priority:High Due date:
Assignee:- % Done:


Category:Corpus Spent time: -
Target version:TXM X.X


Have the possibility to reduce the size of the binary corpus to the minimum necessary size (for example, by deleting some directories like interp, ptokenized, tokenized, treetagger, wtc). This parameter could be set through the parameters interface (tools/parameters/txm/user/export). The default value could be "light" ? (the advantage of choosing the "complete" option should be explaned, anyway).


show what will be exported and add options to select what is exported :
  • results (selected)
  • cqp indexes (selected)
  • txm files (selected)
  • editions (selected)
  • temporary&extra files in a list (unselected)
    • add a SELECT/UNSELECT ALL buttons

Related issues

related to Feature #2712: RCP: X.X, export corpus options New 07/04/2013


#1 Updated by Alexey Lavrentev over 6 years ago

When choosing the "light" option, the user should be asked if (s)he wants to reduce the binary corpus on his/her system or just for the export. This may help save a lot of disk space.
Probably, a new function "Optimize binary corpora" should be created in the Settings menu as well

#2 Updated by Matthieu Decorde over 6 years ago

  • Priority changed from Normal to High

#3 Updated by Serge Heiden over 6 years ago

This ticket interfers with the need to keep (or not) intermediary data files during an import process, for import debug or multiple-import operations. Which should be another ticket related to Import.

Otherwise, for the current ticket. Here is a prefered Scenario: the option should not be designed to help to downsize a binary corpus exported, but instead to augment a standard binary corpus (called "light" in the description) with various secondary informations.

#4 Updated by Matthieu Decorde about 2 years ago

  • Target version changed from 5 to TXM 0.8.0a (split/restructuration)

#5 Updated by Sebastien Jacquot over 1 year ago

  • Target version changed from TXM 0.8.0a (split/restructuration) to TXM 0.8.0

#6 Updated by Matthieu Decorde about 1 year ago

  • Target version changed from TXM 0.8.0 to TXM 0.8.2

#7 Updated by Matthieu Decorde 3 months ago

  • Subject changed from export, corpus : "light" vs "complete" export option to RCP: X.X, export corpus options
  • Description updated (diff)
  • Category set to Corpus

#8 Updated by Matthieu Decorde 19 days ago

  • Description updated (diff)
  • Target version changed from TXM 0.8.2 to TXM X.X

Also available in: Atom PDF