【工具】【parse】berkeleyparser

安裝

不需要安裝,直接從github上download下來就行
https://github.com/slavpetrov/berkeleyparser

使用方法

java -jar BerkeleyParser-1.7.jar -gr <grammar>

<grammar>

然后就啟動了,你輸入一句話,它會返回這句話的parse結果。

查看更多參數(shù)

java -jar BerkeleyParser-1.7.jar

-render                       Write rendered tree to image file. (Default: false)
-maxLength                    Maximum sentence length (Default = 200).
-binarize                     Output binarized trees. (Default: false)
-variational                  Use variational rule score approximation instead of max-rule (Default: false)
-useGoldPOS                   Read data in CoNLL format, including gold part of speech tags.
-dumpPosteriors               Dump max-rule posteriors to disk.
-ec_format                    Use Eugene Charniak's input and output format.
-nThreads                     Parse in parallel using n threads (Default: 1).
-modelScore                   Output effective model score (max rule score for max rule parser) (Default: false)
-keepFunctionLabels           Retain predicted function labels. Model must have been trained with function labels. (Default: false)
-nGrammars                    Use a product model based on that many grammars
-chinese                      Enable some Chinese specific features in the lexicon.
-scores                       Output inside scores (only for binarized viterbi trees). (Default: false)
-tokenize                     Tokenize input first. (Default: false=text is already tokenized)
-substates                    Output subcategories (only for binarized viterbi trees). (Default: false)
-outputFile                   Store output in this file instead of printing it to STDOUT.
-confidence                   Output confidence measure, i.e. likelihood of tree given words: P(T|w) (Default: false)
-gr                           Grammarfile (Required)
 [required]
-accurate                     Set thresholds for accuracy. (Default: set thresholds for efficiency)
-inputFile                    Read input from this file instead of reading it from STDIN.
-kbest                        Output the k best parse max-rule trees (Default: 1).
-tree_likelihood              Output joint likelihood of tree and words: P(t,w) (Default: false)
-viterbi                      Compute viterbi derivation instead of max-rule tree (Default: max-rule)
-sentence_likelihood          Output sentence likelihood, i.e. summing out all parse trees: P(w) (Default: false)
-h                            help message

我的使用方法

import os

def berkeley_parse(input_file, output_file, berkeley_jar='./berkeleyparser/BerkeleyParser-1.7.jar', gr='./berkeleyparser/eng_sm6.gr'):
    os.system('java -jar ' + berkeley_jar + ' -gr ' + gr + ' -inputFile ' + input_file + ' -outputFile ' + output_file)
berkeley_parse(input_file, output_file)
最后編輯于
?著作權歸作者所有,轉載或內容合作請聯(lián)系作者
【社區(qū)內容提示】社區(qū)部分內容疑似由AI輔助生成,瀏覽時請結合常識與多方信息審慎甄別。
平臺聲明:文章內容(如有圖片或視頻亦包括在內)由作者上傳并發(fā)布,文章內容僅代表作者本人觀點,簡書系信息發(fā)布平臺,僅提供信息存儲服務。

友情鏈接更多精彩內容