【问题标题】:TreeTagger installation successful but cannot open .par fileTreeTagger 安装成功,但无法打开 .par 文件
【发布时间】:2013-03-08 08:34:47
【问题描述】:

有人知道如何解决TreeTagger 中的这个文件读取错误,这是一种常见的自然语言处理工具,用于POS 标记、词形还原和分块句子?

alvas@ikoma:~/treetagger$ echo 'Hello world!' | cmd/tree-tagger-english 
        reading parameters ...

ERROR: Can't open for reading: /home/alvas/treetagger/lib/english.par
aborted.

我没有遇到http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/installation-hints.txt 中提示的任何可能的安装问题。 我已按照网页上的说明进行操作,并且已正确安装(http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/#Linux):

alvas@ikoma:~$ mkdir treetagger
alvas@ikoma:~$ cd treetagger
alvas@ikoma:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/tree-tagger-linux-3.2.tar.gz
alvas@ikoma:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/tagger-scripts.tar.gz
alvas@ikoma:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/install-tagger.sh
alvas@ikoma:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/dutch-par-linux-3.2-utf8.bin.gz
alvas@ikoma:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/german-par-linux-3.2-utf8.bin.gz
alvas@ikoma:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/italian-par-linux-3.2-utf8.bin.gz
alvas@ikoma:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/spanish-par-linux-3.2-utf8.bin.gz
alvas@ikoma:~/treetagger$ wget ftp://ftp.ims.uni-stuttgart.de/pub/corpora/french-par-linux-3.2-utf8.bin.gz

alvas@ikoma:~/treetagger$ sh install-tagger.sh 

Linux version of TreeTagger installed.
Tagging scripts installed.
German parameter file (Linux, UTF8) installed.
German chunker parameter file (Linux) installed.
French parameter file (Linux, UTF8) installed.
French chunker parameter file (Linux, UTF8) installed.
Italian parameter file (Linux, UTF8) installed.
Spanish parameter file (Linux, UTF8) installed.
Dutch parameter file (Linux, UTF8) installed.
Path variables modified in tagging scripts.

You might want to add /home/alvas/treetagger/cmd and /home/alvas/treetagger/bin to the PATH variable so that you do not need to specify the full path to run the tagging scripts.

但是当我尝试测试软件时,我得到了这些错误:

alvas@ikoma:~/treetagger$ echo 'Hello world!' | cmd/tree-tagger-english 
    reading parameters ...

ERROR: Can't open for reading: /home/alvas/treetagger/lib/english.par
aborted.
alvas@ikoma:~/treetagger$ echo 'Das ist ein Test.' | cmd/tagger-chunker-german

ERROR: Can't open for reading: /home/alvas/treetagger/lib/german-chunker.par
aborted.

ERROR: Can't open for reading: /home/alvas/treetagger/lib/german.par
aborted.
    reading parameters ...

ERROR: Can't open for reading: /home/alvas/treetagger/lib/german.par
aborted.

【问题讨论】:

    标签: installation nlp stemming pos-tagger lemmatization


    【解决方案1】:

    我遇到了同样的问题。我意识到我为我需要的语言下载的 .par 文件没有被提取(它们仍在 .gz 中)。

    确保先将它们解压到目录中,然后重试。

    【讨论】:

      【解决方案2】:

      你写了cmd/tree-tagger-english,但我认为正确的路径(有参数文件的地方)是:

      lib/tree-tagger-english

      【讨论】:

        【解决方案3】:

        我认为有两个问题:首先,脚本的名称中应该有“-utf8”,例如cmd/tagger-chunker-german-utf8,因为您下载了 UTF-8 数据。其次,标记和分块都需要一个数据文件。请参阅主页,其中有“PC 参数文件”和“PC 分块参数文件”部分 - 从这两个部分下载文件,然后重新执行install-tagger.sh

        【讨论】:

          猜你喜欢
          • 2013-04-19
          • 2013-09-27
          • 2014-10-22
          • 1970-01-01
          • 2019-07-20
          • 2021-03-19
          • 2020-05-18
          • 1970-01-01
          • 1970-01-01
          相关资源
          最近更新 更多