Feature #3356

import, tokenizer step, display total number of tokens and texts at the end of the step

Added by Serge Heiden 2 months ago.

Status:New Start date:03/27/2023
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:Import Spent time: -
Target version:TXM 0.8.4

Description

To help diagnose certain volumetric problems when importing a corpus, it can be interesting to provide an order of magnitude of the corpus.

Even if an import stops at a later stage, the word count is a good indicator of volume.

Also available in: Atom PDF