Feature #1570

Updated by Serge Heiden almost 5 years ago

h3. A. More synthetic display - words & score - whole lexicon

Current specificity results are displayed in the following columns (VOEUX/text@loc:word):
<pre>
Unit F f loc1 sp loc1 f loc2 sp loc2 ...
. 2461 848 17,2 325 -24,3 ...
Une 35 26 8,2 4 -1,0 ...
La 135 65 7,0 8 -6,0 ...
aussi 117 56 6,1 13 -2,4 ...
emploi 60 34 6,0 3 -3,3 ...
</pre>

To explore specific words for two different parts,
the user needs to successively sort each 'score' column
of the parts considered to get access to the most/less
specific words.

An initial specificity results display could be more
synthetic by providing immediately the *most specific
words* for each part. Here is a possible display:
<pre>
u. loc1 sp loc1 u. loc2 sp loc2 ...
. 17,2 ; 14,4 ...
Une 8,2 , 11,2 ...
La 7,0 qu' 10,6 ...
aussi 6,1 en 10,2 ...
emploi 6,0 Algérie 9,3 ...
</pre>

With this display, the user begins to read specificity results globaly at first: which units are specific to each part.

Means could be provided to navigate to the other classical view of specificity results for more detailed information.

h3. A. Solution

This kind of Specificity results display has already been developed in the TXM portal version. Though we don't know its specifications nore its objectives.

h3. B. More synthetic display - score only - index lexicon

To be even more synthetic, one can only display the *specificity scores above a certain threshold* (like banality). Here is a possible display:
<pre>
unit sp loc1 sp loc2 sp loc3 ...
peuple -4,7 4,2 ...
emploi 7,6 -5,3 ...
avenir 5,9 ...
liberté 5,8 ...
confiance ...
</pre>

h3. B. Solution

* add a new 'Filter specificity scores' parameter, with 'yes' as default value
* when FilterSpecificityScores is true, don't display specificity score above the banality threshold in the results table
* ** a further 'Maximum number of units to display in results' parameter can be created to filter also by number of units. Parameter which should be discussed together with the Vmax parameter of Lexicon and Index.

See also:
* Le vocabulaire des confédérations syndicales ouvrières : une analyse des spécificités, Alain Bergounioux, Michel F. Launay, Josette Lefềvre, René Mouriaux, Jean-Pierre Sueur, Mots Year 1981 Volume 2 Issue 1 pp. 139-156, http://www.persee.fr/doc/mots_0243-6450_1981_num_2_1_1025
* Le vocabulaire spécifique des motions Mitterrand, Rocard et CERES au congrès de Metz (1979), Simone Bonnafous, Mots Year 1981 Volume 3 Issue 1 pp. 79-94, http://www.persee.fr/doc/mots_0243-6450_1981_num_3_1_1040

Back