Statistics
| Revision:

root / Portal / configurations / html / Home_srcmf.jsp @ 3

History | View | Annotate | Download (7 kB)

1
<jsp:directive.page contentType="text/html;charset=UTF-8" />
2
<!DOCTYPE html PUBLIC "-//W3C//DTD XHTML 1.0 Transitional//EN" "http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd">
3
<html xmlns="http://www.w3.org/1999/xhtml">
4
<head>
5
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8" />
6
<title></title>
7
</head>
8

    
9
<body>
10
<div style="padding-left:25px;">
11
<h1>Welcome to the SRCMF demo</h1>
12
<p>
13
In this portal you will be able to query the syntactic annotations of the SRCMF sample corpus (Beroul) with the TigerSearch search engine.
14
</p>
15

    
16
<h1> Use of TXM portal interface </h1>
17

    
18
<p>
19
To begin a TigerSearch query:
20
<ul>
21
<li> right click on the "SRCMF2" corpus icon in the left panel.</li>
22
<li> select the "Tiger Search" menu item</li>
23
<li> A new search tab will open, and allow you to type TigerSearch queries</li>
24
<li> The "Search" button will run the query</li>
25
<li> The first matching tree will be displayed in the panel below</li>
26
</ul>
27
</p>
28

    
29
<p>
30
The central bar then allows you to navigate through the results.
31
<ul>
32
<li>the "|<" button shows the first match</li>
33
<li>the "<" button shows the previous match</li>
34
<li>the ">" button shows the next match</li>
35
<li>the ">|" button show the last match</li>
36
<li>the "Export" button opens a new window with the current tree, that you can save in a file</li>
37
</ul>
38
</p>
39

    
40
<p>
41
In the left panel, all the usual TXM commands are also available on the "SRCMF2" corpus (by right clicking on the corpus):
42
<ul>
43
<li> Dimensions: gives the number of tokens and the number of types of the corpus</li>
44
<li> Index: gives the frequency list of results of CQL patterns</li>
45
<li> Lexicon: gives the frequency list of word properties</li>
46
<li> Concordances: builds a KWIC concordance of the results of a CQL query</li>
47
<li> Tiger Search: displays the matching trees of a TIGERSearch query</li>
48
<li> Context: builds a concordance of the results of a CQL query</li>
49
<li> Edition: displays the corpus text</li>
50
</ul>
51
</p>
52

    
53
<h1>SRCMF corpus: TIGERSearch web interface</h1>
54

    
55
<h2>Query format</h2>
56

    
57
<p>Type query using standard TIGERSearch syntax in the format <tt>#node:[ properties ] operator #node:[ properties ]</tt>.
58
Full documentation for the query language may be found <a href = "http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/doc/html/QueryLanguage.html">here</a>;  
59
see also the <a href = "http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/doc/html/QueryLanguage_QuickReference.html">quick reference</a>.
60
</p>
61

    
62
<h2>Model semantics</h2>
63

    
64
<h3>Non-terminal nodes</h3>
65

    
66
<p>The corpus contains two main types of non-terminal node: <strong>heads</strong> and <strong>structures</strong>.  
67
These alternate in the hierarchy: <tt>structure >M head >D structure(s)... >M head</tt>.
68

    
69
<p>Each non-terminal node has a <tt>cat</tt> property.</p>
70
The <tt>cat</tt> property of a head gives its morphological type: 
71
<span title="Finite verb"><tt>VFin</tt></span>, 
72
<span title="Infinitive"><tt>VInf</tt></span>, 
73
<span title="Participle"><tt>VPar</tt></span>, 
74
<span title="Noun"><tt>Nom</tt></span>, 
75
<span title="Adjective"><tt>Adj</tt></span>, 
76
<span title="Adverb"><tt>Adv</tt></span>.
77
<p>The <tt>cat</tt> property of a structure node gives its function relative to the head on which it depends: 
78
<span title="Auxiliated non-finite verb (neither passive nor active)"><tt>Aux</tt></span>,
79
<span title="Auxiliated non-finite verb (active)"><tt>AuxA</tt></span>,
80
<span title="Auxiliated non-finite verb (passive)"><tt>AuxP</tt></span>,
81
<span title="Determiner"><tt>Det</tt></span>,
82
<span title="Modifier"><tt>ModA</tt></span>,
83
<span title="Dislocated modifier"><tt>ModD</tt></span>,
84
<span title="Object attribute"><tt>AtObj</tt></span>,
85
<span title="Subject attribute"><tt>AtSj</tt></span>,
86
<span title="Non-finite clause"><tt>Regim</tt></span>,
87
<span title="Complement"><tt>Cmpl</tt></span>,
88
<span title="Object"><tt>Obj</tt></span>,
89
<span title="Reflexive"><tt>Refl</tt></span>,
90
<span title="Impersonal / expletive subject"><tt>SjImp</tt></span>,
91
<span title="Personal subject"><tt>SjPer</tt></span>,
92
<span title="Adjunct"><tt>Circ</tt></span>,
93
<span title="Negative particle (e.g. pas, mie)"><tt>NgPrt</tt></span>,
94
<span title="Negative"><tt>Ng</tt></span>,
95
<span title="Apostrophe (vocative)"><tt>Apst</tt></span>,
96
<span title="Inserted clause"><tt>Insrt</tt></span>,
97
<span title="Interjection"><tt>Intj</tt></span>.</p>
98

    
99
<p>The head of each sentence (a finite verb) is the daughter of a <strong>maximal structure</strong> (<tt>cat = <span title="Sentence">Snt</span></tt>).</p>
100

    
101
<p>Relating words such a prepositions, conjunctions and subordinated conjunctions (<span title="Non-coordinating relator"><tt>RelNC</tt></span>,
102
<span title="Coordinating relator"><tt>RelC</tt></span>) are daughters of a structure node: structure >R relator.</p>
103

    
104
<p>Coordinated structures are linked to a <strong>group</strong> node by a secondary arc, labelled <tt>coord</tt>: group >~coord structure.</p>
105

    
106
<h3>Terminal nodes</h3>
107

    
108
<p><strong>Words</strong> are terminal nodes. Each terminal node has a <tt>word</tt> property, containing its lexical form, and a <tt>pos</tt> property, containing its part-of-speech (Cattex–TreeTagger).</p>
109

    
110
<h2>Sample queries</h2>
111
<p>
112
The following sample queries may be tested by copying and pasting into the query panel.
113
</p>
114
<p>Find all main clause verbs:<br/>
115
<tt>[cat = "Snt"] >M #vfin:[cat = "VFin"]</tt></p>
116

    
117
<p>Find all structures introduced by a preposition:<br/>
118
<tt>#n >R #relnc:[cat = "RelNC"]<br/>
119
&amp; #relnc >L [pos = /PRE.*/]</tt><br/>
120
</p>
121

    
122
<p>Find all post-verbal NP subjects:<br/>
123
<tt>#verb:[cat = "VFin"] >D #suj:[cat = "SjPer"]<br/>
124
&amp; #suj >M [cat = "Nom"]<br/>
125
&amp; #suj >@l #sword<br/>
126
&amp; #verb >L #vword<br/>
127
&amp; #vword .* #sword</tt></p>
128

    
129
<p>Find indefinite subjects introduced by <q>qui</q>:<br/>
130
<tt>[cat = "VFin"] >D #suj:[cat = "SjPer"]<br/>
131
&amp; #suj >R #relnc:[cat = "RelNC"]<br/>
132
&amp; ( #relnc >L [word = /[QqKk]u?i/]<br/>
133
| #relnc >~dupl [word = /[QqKk]u?i/] )</tt><br/></p>
134

    
135
<p>Find sentences with coordinated subjects:<br/>
136
<tt>#coo:[cat = "Coo"] >~coord #sj1:[cat = "SjPer"]<br/>
137
&amp; #coo >~coord #sj2:[cat = "SjPer"]<br/>
138
&amp; #sj1 $ #sj2</tt></p>
139

    
140
<p>Find sentences with possible <q>gapping</q> of the finite verb (i.e. coordination of subject–predicate pairs):<br/>
141
<tt>#gpcoo1:[cat = "GpCoo"] >~coord #suj1:[cat = "SjPer"]<br/>
142
&amp; #gpcoo1 $.* #gpcoo2:[cat = "GpCoo"]<br/>
143
&amp; #gpcoo2 >~coord #suj2:[cat = "SjPer"]<br/>
144
&amp; #gpcoo1 >~coord #pred1:[cat = /Cmpl|Obj|AtSj/]<br/>
145
&amp; #gpcoo2 >~coord #pred2:[cat = /Cmpl|Obj|AtSj/]<br/>
146
</p>
147

    
148
<h1> Useful links</h1>
149
<p>
150
<ul>
151
<li><a href = "https://listes.cru.fr/wiki/srcmf/index">SRCMF wiki</a></li>
152
<li><a href = "https://textometrie.svn.sourceforge.net/svnroot/textometrie/trunk/doc/Tutoriel%20TXM%20pour%20la%20BFM%20V0.9_Web.odt">TXM tutorial (FR)</a> </li>
153
<li><a href = "http://www.ims.uni-stuttgart.de/projekte/TIGER/TIGERSearch/oldindex.shtml">TIGERSearch project website</a></li>
154
<li><a href = "http://bfm.ens-lyon.fr/">BFM project website</a></li>
155
<li><a href = "http://textometrie.ens-lyon.fr/?lang=en">TXM project website</a></li>
156
</ul>
157
</p>
158
</div>
159
</body>
160
</html>