Models for 1.5 series

 
 

Use the links in the table below to download the pre-trained models for the OpenNLP 1.5 series.

The models are language dependent and only perform well if the model language matches the language of the input text. Also make sure the input text is decoded correctly, depending on the input file encoding this can only be done by explicitly specifying the character encoding. See this Java Tutorial section for further details.

Note: All models are zip compressed (like a jar file), they must not be uncompressed.

Language Component Description Download
da Tokenizer Trained on conllx ddt data. da-token.bin
da Sentence Detector Trained on conllx ddt data. da-sent.bin
da Part of Speech Tagger Maxent model trained on conllx ddt data. da-pos-maxent.bin
da POS Tagger Perceptron model trained on conllx ddt data. da-pos-perceptron.bin
de Tokenizer Trained on tiger data. de-token.bin
de Sentence Detector Trained on tiger data. de-sent.bin
de POS Tagger Maxent model trained on tiger corpus. de-pos-maxent.bin
de POS Tagger Perceptron model trained on tiger corpus. de-pos-perceptron.bin
en Tokenizer Trained on opennlp training data. en-token.bin
en Sentence Detector Trained on opennlp training data. en-sent.bin
en POS Tagger Maxent model with tag dictionary. en-pos-maxent.bin
en POS Tagger Perceptron model with tag dictionary. en-pos-perceptron.bin
en Name Finder Date name finder model. en-ner-date.bin
en Name Finder Location name finder model. en-ner-location.bin
en Name Finder Money name finder model. en-ner-money.bin
en Name Finder Organization name finder model. en-ner-organization.bin
en Name Finder Percentage name finder model. en-ner-percentage.bin
en Name Finder Person name finder model. en-ner-person.bin
en Name Finder Time name finder model. en-ner-time.bin
en Chunker Trained on conll2000 shared task data. en-chunker.bin
en Parser en-parser-chunking.bin
en Coreference coref
es Name Finder Person name finder model. Trained on conll02 shared task data. es-ner-person.bin
es Name Finder Organization name finder model. Trained on conll02 shared task data. es-ner-organization.bin
es Name Finder Location name finder model. Trained on conll02 shared task data. es-ner-location.bin
es Name Finder Misc name finder model. Trained on conll02 shared task data. es-ner-misc.bin
nl Tokenizer Trained on conllx alpino data. nl-token.bin
nl Sentence Detector Trained on conllx alpino data. nl-sent.bin
nl Name Finder Person name finder model. Trained on conll02 shared task data. nl-ner-person.bin
nl Name Finder Organization name finder model. Trained on conll02 shared task data. nl-ner-organization.bin
nl Name Finder Location name finder model. Trained on conll02 shared task data. nl-ner-location.bin
nl Name Finder Misc name finder model. Trained on conll02 shared task data. nl-ner-misc.bin
nl POS Tagger Maxent model trained on conllx alpino data. nl-pos-maxent.bin
nl POS Tagger Perceptron model trained on conllx alpino data. nl-pos-perceptron.bin
pt Tokenizer Trained on conllx bosque data. pt-token.bin
pt Sentence Detector Trained on conllx bosque data. pt-sent.bin
pt POS Tagger Maxent model trained on conllx bosque data. pt-pos-maxent.bin
pt POS Tagger Perceptron model trained on conllx bosque data. pt-pos-perceptron.bin
se Tokenizer Trained on conllx talbanken05 data. se-token.bin
se Sentence Detector Trained on conllx talbanken05 data. se-sent.bin
se POS Tagger Maxent model trained on conllx talbanken05 data. se-pos-maxent.bin
se POS Tagger Perceptron model trained on conllx talbanken05 data. se-pos-perceptron.bin