Statistics
| Revision:

root / tmp / org.txm.core / bin / filters / Tokeniser / test.xml @ 54

History | View | Annotate | Download (602 Bytes)

1
<?xml version="1.0" encoding="UTF-8"?>
2
<text><!--Fichier XML de test du tokenizer XML de TXM. Chaque div contient un cas particulier que le tokenizer doit gérer-->
3
    <div id="ignore element"></div>
4
    <div id="delete element"></div>    
5
    <div id="delete element and its content"></div>
6
    <div id="replace newlines with spaces"></div>
7
    <div id="delete Ctrl (C class) characters"></div>
8
    <div id="split using whitespaces regexp"></div>
9
    <div id="fclitics"></div>
10
    <div id="elision"></div>
11
    <div id="..."></div>
12
    <div id="punct"></div>
13
    <div id="other things"></div>
14
</text>