Bug #2736

RCP: 0.8.0, Back-to link broken for some binary 0.7.9 corpora loaded into TXM 0.8.0

Added by Alexey Lavrentev 7 months ago. Updated 7 months ago.

Status:New Start date:01/22/2020
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:UI / Interaction Spent time: -
Target version:TXM 0.8.2

Description

TXM version: 0.8.0.202001161703 org.txm.rcp.app null
OS: Ubuntu 16.04

To reproduce the bug:

Load ORTOFON2ANNOT3 binary corpus from https://sharedocs.huma-num.fr/wl/?id=vjA7XIdgjw5YCpVVJ1QJIAxQNYmDpTld (created with TXM 0.7.9 XTZ module)!
Make a concordance of any word (e.g. "voda")
Double-click on a concordance line.

Diagnostics

The problem is due to the editions generated with XSLT. The txm:first-word-id meta is set to "w_0" for the first (title) page of every text. TXM 0.7.X first retrieved the text, then the page, so for 1-page editions the right page was displayed. TXM 0.8.0 looks directly for word identifiers.

Solution

Step 1

Change the XSL producing the editions to set the first-word-id to w_textId_0 for the first page.

Step 2

When loading a 0.7.9 corpus check the first-word-id of the first page and insert text identifier if necessary

Step 3

Work out a more robust back-to-text mechanism.

History

#1 Updated by Alexey Lavrentev 7 months ago

  • Description updated (diff)
  • Target version changed from TXM 0.8.1 to TXM 0.8.2

#2 Updated by Alexey Lavrentev 7 months ago

  • Description updated (diff)

Also available in: Atom PDF