Task #1800
RCP: Dictionary: commands to build Palafra frolex-2.0.tsv file
Status: | New | Start date: | 06/08/2016 | |
---|---|---|---|---|
Priority: | Normal | Due date: | ||
Assignee: | - | % Done: | 0% |
|
Category: | Commands | Spent time: | - | |
Target version: | TXM Palafra 3.0 |
Description
see https://groupes.renater.fr/wiki/palafra/public/palafra_lemmatisation_fro/lemmatisation_spec_v2
Validation test¶
- Install Dictionary extension, call File > Install a third party extension
- work with (update site) = http://textometrie.ens-lyon.fr/dist/palafra
- Create 'dmflex' from dmflex.tsv file, call Dictionary > Import:
- name = dmflex
- tsvfile = path to dmflex.tsv
- Convert "category" to "msd_cattex_conv" with "conv_dmf_ctx9-dmf.tsv" conversion table, call Dictionary > Recode Column:
- name = dmflex
- conversionFile = path to "conv_dmf_ctx9-dmf.tsv"
- type (source) = category
- newType (target) = msd_cattex_conv
- mode = abandon
- Remove duplicate entries (unmatched sources), call Dictionary > RemoveEntries:
- name = dmflex
- type = source
- pattern = [A-Z]?[0-9].*|\?|HAddenda.*|ipocon.*|K\(Add.*|liane.*|viorne|x540b|xxx
- Convert "source" to "source" with "dmf-src_frolex-src.tsv" conversion table, call Dictionary > Recode Column:
- name = dmflex
- conversionFile = path to "dmf-src_frolex-src.tsv"
- type (source) = source
- newType (target) = source
- mode = abandon
- Build a hierarchical index of "source" column, call Dictionary > UniqSort:
- name = dmflex
- tsvFile = path to TSV result file
- col = source
- Export the whole dmflex dictionary, call Dictionary > Export:
- name = dmflex
- tsvFile = path to TSV result file
History
#1 Updated by Serge Heiden over 4 years ago
- Subject changed from RCP: Dictionary: commands necesary to build Palafra frolex-2.0.tsv file to RCP: Dictionary: commands to build Palafra frolex-2.0.tsv file
#2 Updated by Matthieu Decorde over 4 years ago
- Description updated (diff)
#3 Updated by Matthieu Decorde over 4 years ago
- Description updated (diff)
#4 Updated by Alexey Lavrentev over 4 years ago
- Description updated (diff)
#5 Updated by Alexey Lavrentev over 4 years ago
- Description updated (diff)
#6 Updated by Alexey Lavrentev over 4 years ago
- Description updated (diff)
#7 Updated by Alexey Lavrentev over 4 years ago
- Description updated (diff)