root / Portal / configurations / html / Home_srcmf.jsp @ 3
Historique | Voir | Annoter | Télécharger (6,99 ko)
1 |
<jsp:directive.page contentType="text/html;charset=UTF-8" /> |
---|---|
2 |
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd"> |
3 |
<html xmlns="http://www.w3.org/1999/xhtml"> |
4 |
<head> |
5 |
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" /> |
6 |
<title></title> |
7 |
</head> |
8 |
|
9 |
<body> |
10 |
<div style="padding-left:25px;"> |
11 |
<h1>Welcome to the SRCMF demo</h1> |
12 |
<p> |
13 |
In this portal you will be able to query the syntactic annotations of the SRCMF sample corpus (Beroul) with the TigerSearch search engine. |
14 |
</p> |
15 |
|
16 |
<h1> Use of TXM portal interface </h1> |
17 |
|
18 |
<p> |
19 |
To begin a TigerSearch query: |
20 |
<ul> |
21 |
<li> right click on the "SRCMF2" corpus icon in the left panel.</li> |
22 |
<li> select the "Tiger Search" menu item</li> |
23 |
<li> A new search tab will open, and allow you to type TigerSearch queries</li> |
24 |
<li> The "Search" button will run the query</li> |
25 |
<li> The first matching tree will be displayed in the panel below</li> |
26 |
</ul> |
27 |
</p> |
28 |
|
29 |
<p> |
30 |
The central bar then allows you to navigate through the results. |
31 |
<ul> |
32 |
<li>the "|<" button shows the first match</li> |
33 |
<li>the "<" button shows the previous match</li> |
34 |
<li>the ">" button shows the next match</li> |
35 |
<li>the ">|" button show the last match</li> |
36 |
<li>the "Export" button opens a new window with the current tree, that you can save in a file</li> |
37 |
</ul> |
38 |
</p> |
39 |
|
40 |
<p> |
41 |
In the left panel, all the usual TXM commands are also available on the "SRCMF2" corpus (by right clicking on the corpus): |
42 |
<ul> |
43 |
<li> Dimensions: gives the number of tokens and the number of types of the corpus</li> |
44 |
<li> Index: gives the frequency list of results of CQL patterns</li> |
45 |
<li> Lexicon: gives the frequency list of word properties</li> |
46 |
<li> Concordances: builds a KWIC concordance of the results of a CQL query</li> |
47 |
<li> Tiger Search: displays the matching trees of a TIGERSearch query</li> |
48 |
<li> Context: builds a concordance of the results of a CQL query</li> |
49 |
<li> Edition: displays the corpus text</li> |
50 |
</ul> |
51 |
</p> |
52 |
|
53 |
<h1>SRCMF corpus: TIGERSearch web interface</h1> |
54 |
|
55 |
<h2>Query format</h2> |
56 |
|
57 |
<p>Type query using standard TIGERSearch syntax in the format <tt>#node:[ properties ] operator #node:[ properties ]</tt>. |
58 |
Full documentation for the query language may be found <a href = "http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/doc/html/QueryLanguage.html">here</a>; |
59 |
see also the <a href = "http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/doc/html/QueryLanguage_QuickReference.html">quick reference</a>. |
60 |
</p> |
61 |
|
62 |
<h2>Model semantics</h2> |
63 |
|
64 |
<h3>Non-terminal nodes</h3> |
65 |
|
66 |
<p>The corpus contains two main types of non-terminal node: <strong>heads</strong> and <strong>structures</strong>. |
67 |
These alternate in the hierarchy: <tt>structure >M head >D structure(s)... >M head</tt>. |
68 |
|
69 |
<p>Each non-terminal node has a <tt>cat</tt> property.</p> |
70 |
The <tt>cat</tt> property of a head gives its morphological type: |
71 |
<span title="Finite verb"><tt>VFin</tt></span>, |
72 |
<span title="Infinitive"><tt>VInf</tt></span>, |
73 |
<span title="Participle"><tt>VPar</tt></span>, |
74 |
<span title="Noun"><tt>Nom</tt></span>, |
75 |
<span title="Adjective"><tt>Adj</tt></span>, |
76 |
<span title="Adverb"><tt>Adv</tt></span>. |
77 |
<p>The <tt>cat</tt> property of a structure node gives its function relative to the head on which it depends: |
78 |
<span title="Auxiliated non-finite verb (neither passive nor active)"><tt>Aux</tt></span>, |
79 |
<span title="Auxiliated non-finite verb (active)"><tt>AuxA</tt></span>, |
80 |
<span title="Auxiliated non-finite verb (passive)"><tt>AuxP</tt></span>, |
81 |
<span title="Determiner"><tt>Det</tt></span>, |
82 |
<span title="Modifier"><tt>ModA</tt></span>, |
83 |
<span title="Dislocated modifier"><tt>ModD</tt></span>, |
84 |
<span title="Object attribute"><tt>AtObj</tt></span>, |
85 |
<span title="Subject attribute"><tt>AtSj</tt></span>, |
86 |
<span title="Non-finite clause"><tt>Regim</tt></span>, |
87 |
<span title="Complement"><tt>Cmpl</tt></span>, |
88 |
<span title="Object"><tt>Obj</tt></span>, |
89 |
<span title="Reflexive"><tt>Refl</tt></span>, |
90 |
<span title="Impersonal / expletive subject"><tt>SjImp</tt></span>, |
91 |
<span title="Personal subject"><tt>SjPer</tt></span>, |
92 |
<span title="Adjunct"><tt>Circ</tt></span>, |
93 |
<span title="Negative particle (e.g. pas, mie)"><tt>NgPrt</tt></span>, |
94 |
<span title="Negative"><tt>Ng</tt></span>, |
95 |
<span title="Apostrophe (vocative)"><tt>Apst</tt></span>, |
96 |
<span title="Inserted clause"><tt>Insrt</tt></span>, |
97 |
<span title="Interjection"><tt>Intj</tt></span>.</p> |
98 |
|
99 |
<p>The head of each sentence (a finite verb) is the daughter of a <strong>maximal structure</strong> (<tt>cat = <span title="Sentence">Snt</span></tt>).</p> |
100 |
|
101 |
<p>Relating words such a prepositions, conjunctions and subordinated conjunctions (<span title="Non-coordinating relator"><tt>RelNC</tt></span>, |
102 |
<span title="Coordinating relator"><tt>RelC</tt></span>) are daughters of a structure node: structure >R relator.</p> |
103 |
|
104 |
<p>Coordinated structures are linked to a <strong>group</strong> node by a secondary arc, labelled <tt>coord</tt>: group >~coord structure.</p> |
105 |
|
106 |
<h3>Terminal nodes</h3> |
107 |
|
108 |
<p><strong>Words</strong> are terminal nodes. Each terminal node has a <tt>word</tt> property, containing its lexical form, and a <tt>pos</tt> property, containing its part-of-speech (Cattex–TreeTagger).</p> |
109 |
|
110 |
<h2>Sample queries</h2> |
111 |
<p> |
112 |
The following sample queries may be tested by copying and pasting into the query panel. |
113 |
</p> |
114 |
<p>Find all main clause verbs:<br/> |
115 |
<tt>[cat = "Snt"] >M #vfin:[cat = "VFin"]</tt></p> |
116 |
|
117 |
<p>Find all structures introduced by a preposition:<br/> |
118 |
<tt>#n >R #relnc:[cat = "RelNC"]<br/> |
119 |
& #relnc >L [pos = /PRE.*/]</tt><br/> |
120 |
</p> |
121 |
|
122 |
<p>Find all post-verbal NP subjects:<br/> |
123 |
<tt>#verb:[cat = "VFin"] >D #suj:[cat = "SjPer"]<br/> |
124 |
& #suj >M [cat = "Nom"]<br/> |
125 |
& #suj >@l #sword<br/> |
126 |
& #verb >L #vword<br/> |
127 |
& #vword .* #sword</tt></p> |
128 |
|
129 |
<p>Find indefinite subjects introduced by <q>qui</q>:<br/> |
130 |
<tt>[cat = "VFin"] >D #suj:[cat = "SjPer"]<br/> |
131 |
& #suj >R #relnc:[cat = "RelNC"]<br/> |
132 |
& ( #relnc >L [word = /[QqKk]u?i/]<br/> |
133 |
| #relnc >~dupl [word = /[QqKk]u?i/] )</tt><br/></p> |
134 |
|
135 |
<p>Find sentences with coordinated subjects:<br/> |
136 |
<tt>#coo:[cat = "Coo"] >~coord #sj1:[cat = "SjPer"]<br/> |
137 |
& #coo >~coord #sj2:[cat = "SjPer"]<br/> |
138 |
& #sj1 $ #sj2</tt></p> |
139 |
|
140 |
<p>Find sentences with possible <q>gapping</q> of the finite verb (i.e. coordination of subject–predicate pairs):<br/> |
141 |
<tt>#gpcoo1:[cat = "GpCoo"] >~coord #suj1:[cat = "SjPer"]<br/> |
142 |
& #gpcoo1 $.* #gpcoo2:[cat = "GpCoo"]<br/> |
143 |
& #gpcoo2 >~coord #suj2:[cat = "SjPer"]<br/> |
144 |
& #gpcoo1 >~coord #pred1:[cat = /Cmpl|Obj|AtSj/]<br/> |
145 |
& #gpcoo2 >~coord #pred2:[cat = /Cmpl|Obj|AtSj/]<br/> |
146 |
</p> |
147 |
|
148 |
<h1> Useful links</h1> |
149 |
<p> |
150 |
<ul> |
151 |
<li><a href = "https://listes.cru.fr/wiki/srcmf/index">SRCMF wiki</a></li> |
152 |
<li><a href = "https://textometrie.svn.sourceforge.net/svnroot/textometrie/trunk/doc/Tutoriel%20TXM%20pour%20la%20BFM%20V0.9_Web.odt">TXM tutorial (FR)</a> </li> |
153 |
<li><a href = "http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/oldindex.shtml">TIGERSearch project website</a></li> |
154 |
<li><a href = "http://bfm.ens-lyon.fr/">BFM project website</a></li> |
155 |
<li><a href = "http://textometrie.ens-lyon.fr/?lang=en">TXM project website</a></li> |
156 |
</ul> |
157 |
</p> |
158 |
</div> |
159 |
</body> |
160 |
</html> |