Feature #1571

RCP: X.X, add one word property to Specificity score and display

Ajouté par Serge Heiden il y a plus de 3 ans. Mis à jour il y a plus de 3 ans.

Statut:New Début:20/10/2015
Priorité:Normal Echéance:
Assigné à:- % réalisé:

0%

Catégorie:Commands Temps passé: -
Version cible:TXM X.X

Description

It would be useful to break down specificity calculus by a secondary word property.

For example one could break down the specificity of some lemma by their pos.

We could display only a synthetic view of the most specific lemma above a certain threshold (like banality):

frlemma/frpos  NOM.*                 ADJ.*                         ...
chirac         solidarité, emploi,   nouveau, fidèle, public,      ...
               avenir, projet,       républicain, sûr              ...
               mondialisation                                      ...
dg             coopération,          scientifique, économique,     ...
               développement, but,   français, algérien, fécond    ...
               peuple, rapport                                     ...
giscard        vœu, bonheur,         simple, français,intelligent, ...
               unité, liberté,       difficile, actif              ...
               ami                                                 ...

Solution

  • Build a specificity table based on a partition index for each secondary property value or set of values and sort it by score:
    • Specificity(VOEUX/text@loc, frpos="NOM.*"/frlemma)
    • Specificity(VOEUX/text@loc, frpos="ADJ.*"/frlemma)
    • ...
  • Merge the specificity tables putting the most specific units in the table cells and the secondary property value or set of values as columns (and original partition values as rows) or rows (and original partition values as columns)
  • add a new 'Filter specificity scores' parameter, with 'yes' as default value
  • when FilterSpecificityScores is true, don't display units under the banality threshold in the results table
  • a further 'Maximum number of units to display in results' parameter can be created to filter also by number of units. Parameter which should be discussed together with the Vmax parameter of Lexicon and Index.
See also:

Historique

#1 Mis à jour par Serge Heiden il y a plus de 3 ans

  • Description mis à jour (diff)

Formats disponibles : Atom PDF