Feature #1983
RCP: X.X, add PDF+CSV import module
Status: | New | Start date: | 01/06/2017 | |
---|---|---|---|---|
Priority: | Normal | Due date: | ||
Assignee: | - | % Done: | 0% |
|
Category: | Import | Spent time: | - | |
Target version: | TXM X.X |
Description
Currently TXM doesn't provide any PDF format related import module for the reasons explained here.
We must help to import sources in PDF format because:- it is massively used
- even if the PDF format is not easy to manage, some PDF representations work well
- some libraries do decent job with respect to some PDF representations
Solution¶
- use the PDF Java library used by GROBID (a tool used by a lot of document management platforms)
- document the fact that PDF import is not perfect
History
#1 Updated by Serge Heiden over 6 years ago
- Description updated (diff)
#2 Updated by Sebastien Jacquot about 5 years ago
- Target version changed from TXM 0.8.0a (split/restructuration) to TXM 0.8.0
#3 Updated by Matthieu Decorde over 4 years ago
- Target version changed from TXM 0.8.0 to TXM 0.8.2
#4 Updated by Matthieu Decorde almost 4 years ago
- Target version changed from TXM 0.8.2 to TXM X.X