Bug #2413

Sending edition text selection to concordance doesn't work if the selection contains punctuation marks

Added by Sebastien Jacquot about 1 year ago. Updated 6 months ago.

Status:New Start date:06/15/2018
Priority:Normal Due date:
Assignee:- % Done:

0%

Category:UI / Link / Command Spent time: -
Target version:TXM 0.8.1

Description

Sending edition text selection to concordance doesn't work if the selection contains punctuation.

There's also a problem with "spaces" that can lead to this kind of query {"donc" ""} that does nothing in concordance.

May need to fix "more" the query to extract the punctuation chars.

New location after split:
/org.txm.edition.rcp/src/org/txm/edition/rcp/handlers/SendEditionTo.java

Solution

SLH: edition text characters selection should really be edition text words selection, that is the internal CQP tokens matching the surface 'browser text characters selection' representing the CQP tokens word forms.

So it is more a matter of matching and selecting the correct word tokens under the surface than cleaning selected characters.

See the URS Annotation extension code to see how to select the token IDs under the browser characters selection, and how to extend the character selection to the characters corresponding to the real graphical form boundaries of the words.

Solution 2

  1. define the wanted behavior for multiple words/token selection
    1. sequential or not sequential query? -> SLH: always sequential. If the characters sequence denotes a sequence of words, the query must search for a sequence of words.
  2. fix the query according to the above choice, eg. [word = "domaines"][word = "politiques"] or [word = "domaines"]|[word = "politique"]

NOTE: sequential queries behavior is different according to current CQP matching strategy, by default concordance doesn't build the pivot with all te sequence

History

#1 Updated by Sebastien Jacquot about 1 year ago

  • Description updated (diff)

#2 Updated by Sebastien Jacquot 12 months ago

  • Target version changed from TXM 0.8.0a (split/restructuration) to TXM 0.8.0

#3 Updated by Sebastien Jacquot 6 months ago

  • Description updated (diff)
  • Target version changed from TXM 0.8.0 to TXM 0.8.1

#4 Updated by Serge Heiden 6 months ago

  • Description updated (diff)

Also available in: Atom PDF