root / Portal / configurations / html / Home_srcmf.jsp @ 3
Historique | Voir | Annoter | Télécharger (6,99 ko)
1 | 3 | alavrent | <jsp:directive.page contentType="text/html;charset=UTF-8" /> |
---|---|---|---|
2 | 3 | alavrent | <!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> |
3 | 3 | alavrent | <html xmlns="http://www.w3.org/1999/xhtml"> |
4 | 3 | alavrent | <head> |
5 | 3 | alavrent | <meta http-equiv="Content-Type" content="text/html; charset=UTF-8" /> |
6 | 3 | alavrent | <title></title> |
7 | 3 | alavrent | </head> |
8 | 3 | alavrent | |
9 | 3 | alavrent | <body> |
10 | 3 | alavrent | <div style="padding-left:25px;"> |
11 | 3 | alavrent | <h1>Welcome to the SRCMF demo</h1> |
12 | 3 | alavrent | <p> |
13 | 3 | alavrent | In this portal you will be able to query the syntactic annotations of the SRCMF sample corpus (Beroul) with the TigerSearch search engine. |
14 | 3 | alavrent | </p> |
15 | 3 | alavrent | |
16 | 3 | alavrent | <h1> Use of TXM portal interface </h1> |
17 | 3 | alavrent | |
18 | 3 | alavrent | <p> |
19 | 3 | alavrent | To begin a TigerSearch query: |
20 | 3 | alavrent | <ul> |
21 | 3 | alavrent | <li> right click on the "SRCMF2" corpus icon in the left panel.</li> |
22 | 3 | alavrent | <li> select the "Tiger Search" menu item</li> |
23 | 3 | alavrent | <li> A new search tab will open, and allow you to type TigerSearch queries</li> |
24 | 3 | alavrent | <li> The "Search" button will run the query</li> |
25 | 3 | alavrent | <li> The first matching tree will be displayed in the panel below</li> |
26 | 3 | alavrent | </ul> |
27 | 3 | alavrent | </p> |
28 | 3 | alavrent | |
29 | 3 | alavrent | <p> |
30 | 3 | alavrent | The central bar then allows you to navigate through the results. |
31 | 3 | alavrent | <ul> |
32 | 3 | alavrent | <li>the "|<" button shows the first match</li> |
33 | 3 | alavrent | <li>the "<" button shows the previous match</li> |
34 | 3 | alavrent | <li>the ">" button shows the next match</li> |
35 | 3 | alavrent | <li>the ">|" button show the last match</li> |
36 | 3 | alavrent | <li>the "Export" button opens a new window with the current tree, that you can save in a file</li> |
37 | 3 | alavrent | </ul> |
38 | 3 | alavrent | </p> |
39 | 3 | alavrent | |
40 | 3 | alavrent | <p> |
41 | 3 | alavrent | In the left panel, all the usual TXM commands are also available on the "SRCMF2" corpus (by right clicking on the corpus): |
42 | 3 | alavrent | <ul> |
43 | 3 | alavrent | <li> Dimensions: gives the number of tokens and the number of types of the corpus</li> |
44 | 3 | alavrent | <li> Index: gives the frequency list of results of CQL patterns</li> |
45 | 3 | alavrent | <li> Lexicon: gives the frequency list of word properties</li> |
46 | 3 | alavrent | <li> Concordances: builds a KWIC concordance of the results of a CQL query</li> |
47 | 3 | alavrent | <li> Tiger Search: displays the matching trees of a TIGERSearch query</li> |
48 | 3 | alavrent | <li> Context: builds a concordance of the results of a CQL query</li> |
49 | 3 | alavrent | <li> Edition: displays the corpus text</li> |
50 | 3 | alavrent | </ul> |
51 | 3 | alavrent | </p> |
52 | 3 | alavrent | |
53 | 3 | alavrent | <h1>SRCMF corpus: TIGERSearch web interface</h1> |
54 | 3 | alavrent | |
55 | 3 | alavrent | <h2>Query format</h2> |
56 | 3 | alavrent | |
57 | 3 | alavrent | <p>Type query using standard TIGERSearch syntax in the format <tt>#node:[ properties ] operator #node:[ properties ]</tt>. |
58 | 3 | alavrent | Full documentation for the query language may be found <a href = "http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/doc/html/QueryLanguage.html">here</a>; |
59 | 3 | alavrent | see also the <a href = "http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/doc/html/QueryLanguage_QuickReference.html">quick reference</a>. |
60 | 3 | alavrent | </p> |
61 | 3 | alavrent | |
62 | 3 | alavrent | <h2>Model semantics</h2> |
63 | 3 | alavrent | |
64 | 3 | alavrent | <h3>Non-terminal nodes</h3> |
65 | 3 | alavrent | |
66 | 3 | alavrent | <p>The corpus contains two main types of non-terminal node: <strong>heads</strong> and <strong>structures</strong>. |
67 | 3 | alavrent | These alternate in the hierarchy: <tt>structure >M head >D structure(s)... >M head</tt>. |
68 | 3 | alavrent | |
69 | 3 | alavrent | <p>Each non-terminal node has a <tt>cat</tt> property.</p> |
70 | 3 | alavrent | The <tt>cat</tt> property of a head gives its morphological type: |
71 | 3 | alavrent | <span title="Finite verb"><tt>VFin</tt></span>, |
72 | 3 | alavrent | <span title="Infinitive"><tt>VInf</tt></span>, |
73 | 3 | alavrent | <span title="Participle"><tt>VPar</tt></span>, |
74 | 3 | alavrent | <span title="Noun"><tt>Nom</tt></span>, |
75 | 3 | alavrent | <span title="Adjective"><tt>Adj</tt></span>, |
76 | 3 | alavrent | <span title="Adverb"><tt>Adv</tt></span>. |
77 | 3 | alavrent | <p>The <tt>cat</tt> property of a structure node gives its function relative to the head on which it depends: |
78 | 3 | alavrent | <span title="Auxiliated non-finite verb (neither passive nor active)"><tt>Aux</tt></span>, |
79 | 3 | alavrent | <span title="Auxiliated non-finite verb (active)"><tt>AuxA</tt></span>, |
80 | 3 | alavrent | <span title="Auxiliated non-finite verb (passive)"><tt>AuxP</tt></span>, |
81 | 3 | alavrent | <span title="Determiner"><tt>Det</tt></span>, |
82 | 3 | alavrent | <span title="Modifier"><tt>ModA</tt></span>, |
83 | 3 | alavrent | <span title="Dislocated modifier"><tt>ModD</tt></span>, |
84 | 3 | alavrent | <span title="Object attribute"><tt>AtObj</tt></span>, |
85 | 3 | alavrent | <span title="Subject attribute"><tt>AtSj</tt></span>, |
86 | 3 | alavrent | <span title="Non-finite clause"><tt>Regim</tt></span>, |
87 | 3 | alavrent | <span title="Complement"><tt>Cmpl</tt></span>, |
88 | 3 | alavrent | <span title="Object"><tt>Obj</tt></span>, |
89 | 3 | alavrent | <span title="Reflexive"><tt>Refl</tt></span>, |
90 | 3 | alavrent | <span title="Impersonal / expletive subject"><tt>SjImp</tt></span>, |
91 | 3 | alavrent | <span title="Personal subject"><tt>SjPer</tt></span>, |
92 | 3 | alavrent | <span title="Adjunct"><tt>Circ</tt></span>, |
93 | 3 | alavrent | <span title="Negative particle (e.g. pas, mie)"><tt>NgPrt</tt></span>, |
94 | 3 | alavrent | <span title="Negative"><tt>Ng</tt></span>, |
95 | 3 | alavrent | <span title="Apostrophe (vocative)"><tt>Apst</tt></span>, |
96 | 3 | alavrent | <span title="Inserted clause"><tt>Insrt</tt></span>, |
97 | 3 | alavrent | <span title="Interjection"><tt>Intj</tt></span>.</p> |
98 | 3 | alavrent | |
99 | 3 | alavrent | <p>The head of each sentence (a finite verb) is the daughter of a <strong>maximal structure</strong> (<tt>cat = <span title="Sentence">Snt</span></tt>).</p> |
100 | 3 | alavrent | |
101 | 3 | alavrent | <p>Relating words such a prepositions, conjunctions and subordinated conjunctions (<span title="Non-coordinating relator"><tt>RelNC</tt></span>, |
102 | 3 | alavrent | <span title="Coordinating relator"><tt>RelC</tt></span>) are daughters of a structure node: structure >R relator.</p> |
103 | 3 | alavrent | |
104 | 3 | alavrent | <p>Coordinated structures are linked to a <strong>group</strong> node by a secondary arc, labelled <tt>coord</tt>: group >~coord structure.</p> |
105 | 3 | alavrent | |
106 | 3 | alavrent | <h3>Terminal nodes</h3> |
107 | 3 | alavrent | |
108 | 3 | alavrent | <p><strong>Words</strong> are terminal nodes. Each terminal node has a <tt>word</tt> property, containing its lexical form, and a <tt>pos</tt> property, containing its part-of-speech (Cattex–TreeTagger).</p> |
109 | 3 | alavrent | |
110 | 3 | alavrent | <h2>Sample queries</h2> |
111 | 3 | alavrent | <p> |
112 | 3 | alavrent | The following sample queries may be tested by copying and pasting into the query panel. |
113 | 3 | alavrent | </p> |
114 | 3 | alavrent | <p>Find all main clause verbs:<br/> |
115 | 3 | alavrent | <tt>[cat = "Snt"] >M #vfin:[cat = "VFin"]</tt></p> |
116 | 3 | alavrent | |
117 | 3 | alavrent | <p>Find all structures introduced by a preposition:<br/> |
118 | 3 | alavrent | <tt>#n >R #relnc:[cat = "RelNC"]<br/> |
119 | 3 | alavrent | & #relnc >L [pos = /PRE.*/]</tt><br/> |
120 | 3 | alavrent | </p> |
121 | 3 | alavrent | |
122 | 3 | alavrent | <p>Find all post-verbal NP subjects:<br/> |
123 | 3 | alavrent | <tt>#verb:[cat = "VFin"] >D #suj:[cat = "SjPer"]<br/> |
124 | 3 | alavrent | & #suj >M [cat = "Nom"]<br/> |
125 | 3 | alavrent | & #suj >@l #sword<br/> |
126 | 3 | alavrent | & #verb >L #vword<br/> |
127 | 3 | alavrent | & #vword .* #sword</tt></p> |
128 | 3 | alavrent | |
129 | 3 | alavrent | <p>Find indefinite subjects introduced by <q>qui</q>:<br/> |
130 | 3 | alavrent | <tt>[cat = "VFin"] >D #suj:[cat = "SjPer"]<br/> |
131 | 3 | alavrent | & #suj >R #relnc:[cat = "RelNC"]<br/> |
132 | 3 | alavrent | & ( #relnc >L [word = /[QqKk]u?i/]<br/> |
133 | 3 | alavrent | | #relnc >~dupl [word = /[QqKk]u?i/] )</tt><br/></p> |
134 | 3 | alavrent | |
135 | 3 | alavrent | <p>Find sentences with coordinated subjects:<br/> |
136 | 3 | alavrent | <tt>#coo:[cat = "Coo"] >~coord #sj1:[cat = "SjPer"]<br/> |
137 | 3 | alavrent | & #coo >~coord #sj2:[cat = "SjPer"]<br/> |
138 | 3 | alavrent | & #sj1 $ #sj2</tt></p> |
139 | 3 | alavrent | |
140 | 3 | alavrent | <p>Find sentences with possible <q>gapping</q> of the finite verb (i.e. coordination of subject–predicate pairs):<br/> |
141 | 3 | alavrent | <tt>#gpcoo1:[cat = "GpCoo"] >~coord #suj1:[cat = "SjPer"]<br/> |
142 | 3 | alavrent | & #gpcoo1 $.* #gpcoo2:[cat = "GpCoo"]<br/> |
143 | 3 | alavrent | & #gpcoo2 >~coord #suj2:[cat = "SjPer"]<br/> |
144 | 3 | alavrent | & #gpcoo1 >~coord #pred1:[cat = /Cmpl|Obj|AtSj/]<br/> |
145 | 3 | alavrent | & #gpcoo2 >~coord #pred2:[cat = /Cmpl|Obj|AtSj/]<br/> |
146 | 3 | alavrent | </p> |
147 | 3 | alavrent | |
148 | 3 | alavrent | <h1> Useful links</h1> |
149 | 3 | alavrent | <p> |
150 | 3 | alavrent | <ul> |
151 | 3 | alavrent | <li><a href = "https://listes.cru.fr/wiki/srcmf/index">SRCMF wiki</a></li> |
152 | 3 | alavrent | <li><a href = "https://textometrie.svn.sourceforge.net/svnroot/textometrie/trunk/doc/Tutoriel%20TXM%20pour%20la%20BFM%20V0.9_Web.odt">TXM tutorial (FR)</a> </li> |
153 | 3 | alavrent | <li><a href = "http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/oldindex.shtml">TIGERSearch project website</a></li> |
154 | 3 | alavrent | <li><a href = "http://bfm.ens-lyon.fr/">BFM project website</a></li> |
155 | 3 | alavrent | <li><a href = "http://textometrie.ens-lyon.fr/?lang=en">TXM project website</a></li> |
156 | 3 | alavrent | </ul> |
157 | 3 | alavrent | </p> |
158 | 3 | alavrent | </div> |
159 | 3 | alavrent | </body> |
160 | 3 | alavrent | </html> |