Bug #1549

RCP: 0.7.7, Alceste import bugs

Added by Matthieu Decorde about 8 years ago. Updated over 4 years ago.

Status:New Start date:10/02/2015
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:Import Spent time: -
Target version:TXM X.X

Description

A) Accents in text property names

With UTF-8 character encoding and an ISO-latin-1 source file, accents in property names break the import.

B) Digit prefix in the begining of text lines (starry lines format)

  • '001 ' begining of lines are not recognized as starry line
001 *loc_dg *année_1969 *ver

Solution

Use regex + 1 property minimum pattern to decide if it is a starry line: "[0-9]+ \*[a-z]+"

C) Print a better error message in case of starry lines detection problem

Solution

If no starry lines (no texts) are detected in an input file, the error message must be:

** Alceste import: no text delimitation lines detected in file "%s" (suggestion: verify the syntax of your text delimitation lines - starry lines - '**** *prop_val' or '01 *prop_val')

History

#1 Updated by Serge Heiden about 8 years ago

  • Subject changed from RCP: X.X, Alceste import bugs to RCP: 0.7.7, Alceste import bugs
  • Description updated (diff)

#2 Updated by Serge Heiden about 8 years ago

  • Description updated (diff)

#3 Updated by Sebastien Jacquot over 5 years ago

  • Target version changed from TXM 0.8.0a (split/restructuration) to TXM 0.8.0

#4 Updated by Serge Heiden over 4 years ago

  • Description updated (diff)

#5 Updated by Serge Heiden over 4 years ago

  • Description updated (diff)

#6 Updated by Matthieu Decorde over 4 years ago

  • Target version changed from TXM 0.8.0 to TXM X.X

Also available in: Atom PDF