Feature #1983

RCP: X.X, add PDF+CSV import module

Added by Serge Heiden over 2 years ago. Updated about 1 month ago.

Status:New Start date:01/06/2017
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:Import Spent time: -
Target version:TXM 0.8.1

Description

Currently TXM doesn't provide any PDF format related import module for the reasons explained here.

We must help to import sources in PDF format because:
  • it is massively used
  • even if the PDF format is not easy to manage, some PDF representations work well
  • some libraries do decent job with respect to some PDF representations

Solution

  • use the PDF Java library used by GROBID (a tool used by a lot of document management platforms)
  • document the fact that PDF import is not perfect

History

#1 Updated by Serge Heiden over 2 years ago

  • Description updated (diff)

#2 Updated by Sebastien Jacquot 10 months ago

  • Target version changed from TXM 0.8.0a (split/restructuration) to TXM 0.8.0

#3 Updated by Matthieu Decorde about 1 month ago

  • Target version changed from TXM 0.8.0 to TXM 0.8.1

Also available in: Atom PDF