Feature #2009

TBX: X.X, add line number to concordance default reference for TXT based import modules

Added by Serge Heiden over 6 years ago. Updated over 4 years ago.

Status:New Start date:01/21/2017
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:Import Spent time: -
Target version:TXM X.X

Description

Currently, the default concordance references for TXT based import modules (TXT+CSV, Clipboard...) are the text id.
Raw texts have line numbers that can be useful to display to have an idea of where a concordance pivot occurs along a text (at the end, in the middle, before another, etc.) and also to get back to the sources (for example for text correction or text extraction).

Solution

Build in the TXT based import modules a default reference w@ref with "text@id, w@lbn".
For example :

leviathan, 12

Solution 2

We could also build a new synthetic metadata text@lbn with the total number of lines as value.
And provide a default reference like "text@id, w@lbn/text@lbn".
For example :

leviathan, 12/23000

Solution 3

We could also build a new synthetic internal metadata text@lbnl with the total number of digits used by the total number of lines as value = Math.log10(text@lbn)+1 as int
And provide a default reference like "text@id, <.. filling spaces ...>w@lbn".
For example :

leviathan,    12/23000
leviathan,  1312/23000

History

#1 Updated by Serge Heiden over 6 years ago

  • Description updated (diff)

#2 Updated by Sebastien Jacquot about 5 years ago

  • Target version changed from TXM 0.8.0a (split/restructuration) to TXM 0.8.0

#3 Updated by Matthieu Decorde over 4 years ago

  • Target version changed from TXM 0.8.0 to TXM X.X

Also available in: Atom PDF