Bug #1549
RCP: 0.7.7, Alceste import bugs
Status: | New | Start date: | 10/02/2015 | |
---|---|---|---|---|
Priority: | Normal | Due date: | ||
Assignee: | - | % Done: | 0% |
|
Category: | Import | Spent time: | - | |
Target version: | TXM X.X |
Description
A) Accents in text property names¶
With UTF-8 character encoding and an ISO-latin-1 source file, accents in property names break the import.
B) Digit prefix in the begining of text lines (starry lines format)¶
- '001 ' begining of lines are not recognized as starry line
001 *loc_dg *année_1969 *ver
Solution
Use regex + 1 property minimum pattern to decide if it is a starry line: "[0-9]+ \*[a-z]+"
C) Print a better error message in case of starry lines detection problem¶
Solution
If no starry lines (no texts) are detected in an input file, the error message must be:
** Alceste import: no text delimitation lines detected in file "%s" (suggestion: verify the syntax of your text delimitation lines - starry lines - '**** *prop_val' or '01 *prop_val')
History
#1 Updated by Serge Heiden about 8 years ago
- Subject changed from RCP: X.X, Alceste import bugs to RCP: 0.7.7, Alceste import bugs
- Description updated (diff)
#2 Updated by Serge Heiden about 8 years ago
- Description updated (diff)
#3 Updated by Sebastien Jacquot over 5 years ago
- Target version changed from TXM 0.8.0a (split/restructuration) to TXM 0.8.0
#4 Updated by Serge Heiden over 4 years ago
- Description updated (diff)
#5 Updated by Serge Heiden over 4 years ago
- Description updated (diff)
#6 Updated by Matthieu Decorde over 4 years ago
- Target version changed from TXM 0.8.0 to TXM X.X