Feature #2341

RCP: X.X, fill the "Main language" input parameter values menu from the list of available language model files

Added by Serge Heiden over 1 year ago. Updated 3 months ago.

Status:New Start date:02/09/2018
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:Import Spent time: -
Target version:TXM X.X

Description

Solution spec

  • create a general NLP interface/component
  • add a tokenizer, tagger, lemmatizer, parser, etc. sub-interfaces to the NLP component
  • add a setProcessor method to the tagger interface
  • add a language interface to the tagger interface
  • add a addLanguage method to the tagger interface
  • add a getAvailableLanguages method to the language interface
  • add a new NLP preferences UI
  • add a setTaggerProcessor UI in the NLP preferences UI
  • add a tagger preferences UI (depends on the processor)
  • Solution 1
    • populate a availableLanguages field of the tagger interface with the addLanguage method from the processor parameters directory
  • Solution 2
    • add a Add Language UI in the tagger preferences UI
    • populate a availableLanguages field of the tagger interface with the addLanguage method from the Add Language UI
  • etc.
  • build the "Main language" input parameter values list by calling the getAvailableLanguages method

Solution implementation

Check available NLP interfaces before building a new one: eg. (sorted by interest) DKPro, Stanford NLP, OpenNLP, LingPipe, Lucene and Solr, GATE , UIMA, MALLET, Icarus

Check available NLP UI before building a new one: eg. WebLicht, TextFlows

History

#1 Updated by Sebastien Jacquot 12 months ago

  • Target version changed from TXM 0.8.0a (split/restructuration) to TXM 0.8.0

#2 Updated by Matthieu Decorde 3 months ago

  • Target version changed from TXM 0.8.0 to TXM X.X

Also available in: Atom PDF