OpenNLP - интерфейс командной строки
OpenNLP предоставляет интерфейс командной строки (CLI) для выполнения различных операций через командную строку. В этой главе мы рассмотрим несколько примеров, чтобы показать, как мы можем использовать интерфейс командной строки OpenNLP.
Токенизация
input.txt
Hi. How are you? Welcome to Tutorialspoint. We provide free tutorials on various technologies
Синтаксис
> opennlp TokenizerME path_for_models../en-token.bin <inputfile..> outputfile..
команда
C:\> opennlp TokenizerME C:\OpenNLP_models/en-token.bin <input.txt >output.txt
вывод
Loading Tokenizer model ... done (0.207s)
Average: 214.3 sent/s
Total: 3 sent
Runtime: 0.014s
output.txt
Hi . How are you ? Welcome to Tutorialspoint . We provide free tutorials on various technologies
Обнаружение приговора
input.txt
Hi. How are you? Welcome to Tutorialspoint. We provide free tutorials on various technologies
Синтаксис
> opennlp SentenceDetector path_for_models../en-token.bin <inputfile..> outputfile..
команда
C:\> opennlp SentenceDetector C:\OpenNLP_models/en-sent.bin <input.txt > output_sendet.txt
Вывод
Loading Sentence Detector model ... done (0.067s)
Average: 750.0 sent/s
Total: 3 sent
Runtime: 0.004s
Output_sendet.txt
Hi. How are you?
Welcome to Tutorialspoint.
We provide free tutorials on various technologies
Признание именованных сущностей
input.txt
<START:person> <START:person> Mike <END> <END> is senior programming manager and
<START:person> Rama <END> is a clerk both are working at Tutorialspoint
Синтаксис
> opennlp TokenNameFinder path_for_models../en-token.bin <inputfile..
Команда
C:\>opennlp TokenNameFinder C:\OpenNLP_models\en-ner-person.bin <input_namefinder.txt
Вывод
Loading Token Name Finder model ... done (0.730s)
<START:person> <START:person> Mike <END> <END> is senior programming manager and
<START:person> Rama <END> is a clerk both are working at Tutorialspoint
Average: 55.6 sent/s
Total: 1 sent
Runtime: 0.018s
Части речевых тегов
Input.txt
Hi. How are you? Welcome to Tutorialspoint. We provide free tutorials on various technologies
Синтаксис
> opennlp POSTagger path_for_models../en-token.bin <inputfile..
Команда
C:\>opennlp POSTagger C:\OpenNLP_models/en-pos-maxent.bin < input.txt
Вывод
Loading POS Tagger model ... done (1.315s)
Hi._NNP How_WRB are_VBP you?_JJ Welcome_NNP to_TO Tutorialspoint._NNP We_PRP
provide_VBP free_JJ tutorials_NNS on_IN various_JJ technologies_NNS
Average: 66.7 sent/s
Total: 1 sent
Runtime: 0.015s