root / tmp / org.txm.treetagger.core.win32 / res / win32 / INSTALL.txt @ 1670
History | View | Annotate | Download (3.1 kB)
1 |
|
---|---|
2 |
/****************************************************************************/ |
3 |
/* How to install the Windows version of the TreeTagger */ |
4 |
/* */ |
5 |
/* Author: Helmut Schmid, CIS, Ludwig-Maximilians-Universität, Germany */ |
6 |
/****************************************************************************/ |
7 |
|
8 |
This is the Windows distribution of the TreeTagger. |
9 |
|
10 |
It contains the following files: |
11 |
|
12 |
- tree-tagger.exe: the tagger program |
13 |
|
14 |
- train-tree-tagger.exe: the training program |
15 |
|
16 |
- utf8-tokenize.perl: A Perl script which transforms the tagger input |
17 |
into one-word-perl-line format |
18 |
|
19 |
- *-abbreviations: abbreviation lists required by the tokenizer |
20 |
|
21 |
- tag-*.bat: batch files for different languages which call |
22 |
the tokeniser and the tagger |
23 |
|
24 |
- chunk-*.bat batch files for POS tagging and chunking |
25 |
|
26 |
|
27 |
Installation |
28 |
------------ |
29 |
|
30 |
1. Install a Perl interpreter (if you have not already installed one). |
31 |
You can download a Perl interpreter for Windows for free at |
32 |
http://www.activestate.com/activeperl/ |
33 |
|
34 |
2. Move the TreeTagger directory to the root directory of drive C:. |
35 |
|
36 |
3. Download the PC parameter files for the languages you need, decompress |
37 |
them (e.g. using Winzip or 7zip) and move them to the subdirectory lib. |
38 |
Rename the parameter files to <language>-utf8.par |
39 |
Example: Rename french-par-linux-3.2-utf8.bin to french-utf8.par |
40 |
|
41 |
Non-UTF8 parameter files are not supported anymore. |
42 |
|
43 |
4. Add the path C:\TreeTagger\bin to the PATH environment variable. |
44 |
|
45 |
5. Open a shell and type the command |
46 |
set PATH=C:\TreeTagger\bin;%PATH% |
47 |
|
48 |
6. Change to the directory C:\TreeTagger |
49 |
|
50 |
7. Now you can test the tagger, e.g. by analyzing this file with the command |
51 |
tag-english INSTALL.txt |
52 |
|
53 |
If you install the TreeTagger in a different directory, you have to |
54 |
modify the first path in the batch files tag-*.bat. |
55 |
|
56 |
|
57 |
Michaela Atterer told me that she had difficulties to install the |
58 |
TreeTagger on a Windows XP system. She recommends the following |
59 |
work-around. |
60 |
|
61 |
4. Windows XP: |
62 |
-Right click on "My Computer" |
63 |
-Select the "Advanced" tab |
64 |
-Click on "Environment Variables" |
65 |
-click on New: enter PATH and C:\TreeTagger\bin\;%PATH% |
66 |
|
67 |
If the files have been unpacked into a single directory, you should |
68 |
restore the following directory structure: |
69 |
|
70 |
TreeTagger: |
71 |
INSTALL.txt README.txt bin cmd lib |
72 |
|
73 |
TreeTagger/bin: |
74 |
tag-english.bat tag-german.bat tag-spanish.bat tree-tagger.exe |
75 |
tag-french.bat tag-italian.bat train-tree-tagger.exe |
76 |
|
77 |
TreeTagger/cmd: |
78 |
mwl-lookup.perl tokenize.pl |
79 |
|
80 |
TreeTagger/lib: |
81 |
english-abbreviations german-abbreviations spanish-abbreviations |
82 |
french-abbreviations italian-abbreviations spanish-mwls |
83 |
|
84 |
|
85 |
Note that the TreeTagger comes without a graphical interface. You have |
86 |
to run it by entering a command in a command line window. If you prefer |
87 |
a graphical interface, try the one provided by Ciarán Ó Duibhín at |
88 |
http://www.smo.uhi.ac.uk/~oduibhin/oideasra/interfaces/winttinterface.htm |
89 |
|