|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |
java.lang.Object opennlp.tools.chunker.ChunkerME
public class ChunkerME
The class represents a maximum-entropy-based chunker. Such a chunker can be used to find flat structures based on sequence inputs such as noun phrases or named entities.
Field Summary | |
---|---|
protected BeamSearch<String> |
beam
The beam used to search for sequences of chunk tag assignments. |
static int |
DEFAULT_BEAM_SIZE
|
protected opennlp.model.MaxentModel |
model
The model used to assign chunk tags to a sequence of tokens. |
Constructor Summary | |
---|---|
ChunkerME(ChunkerModel model)
Initializes the current instance with the specified model. |
|
ChunkerME(ChunkerModel model,
int beamSize)
Initializes the current instance with the specified model and the specified beam size. |
|
ChunkerME(ChunkerModel model,
int beamSize,
SequenceValidator<String> sequenceValidator)
Initializes the current instance with the specified model and the specified beam size. |
|
ChunkerME(ChunkerModel model,
int beamSize,
SequenceValidator<String> sequenceValidator,
ChunkerContextGenerator contextGenerator)
Initializes the current instance with the specified model and the specified beam size. |
|
ChunkerME(opennlp.model.MaxentModel mod)
Deprecated. |
|
ChunkerME(opennlp.model.MaxentModel mod,
ChunkerContextGenerator cg)
Deprecated. |
|
ChunkerME(opennlp.model.MaxentModel mod,
ChunkerContextGenerator cg,
int beamSize)
Deprecated. |
Method Summary | |
---|---|
List<String> |
chunk(List<String> toks,
List<String> tags)
Generates chunk tags for the given sequence returning the result in a list. |
String[] |
chunk(String[] toks,
String[] tags)
Generates chunk tags for the given sequence returning the result in an array. |
static void |
main(String[] args)
Deprecated. |
double[] |
probs()
Returns an array with the probabilities of the last decoded sequence. |
void |
probs(double[] probs)
Populates the specified array with the probabilities of the last decoded sequence. |
Sequence[] |
topKSequences(List<String> sentence,
List<String> tags)
Returns the top k chunk sequences for the specified sentence with the specified pos-tags |
Sequence[] |
topKSequences(String[] sentence,
String[] tags,
double minSequenceScore)
Returns the top k chunk sequences for the specified sentence with the specified pos-tags |
static ChunkerModel |
train(String lang,
ObjectStream<ChunkSample> in,
int cutoff,
int iterations)
Trains a new model for the ChunkerME . |
static ChunkerModel |
train(String lang,
ObjectStream<ChunkSample> in,
int cutoff,
int iterations,
ChunkerContextGenerator contextGenerator)
|
Methods inherited from class java.lang.Object |
---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Field Detail |
---|
public static final int DEFAULT_BEAM_SIZE
protected BeamSearch<String> beam
protected opennlp.model.MaxentModel model
Constructor Detail |
---|
public ChunkerME(ChunkerModel model, int beamSize, SequenceValidator<String> sequenceValidator, ChunkerContextGenerator contextGenerator)
model
- The model for this chunker.cacheSize
- beamSize
- The size of the beam that should be used when decoding sequences.sequenceValidator
- The SequenceValidator
to determines whether the outcome
is valid for the preceding sequence. This can be used to implement constraints
on what sequences are valid.public ChunkerME(ChunkerModel model, int beamSize, SequenceValidator<String> sequenceValidator)
model
- The model for this chunker.beamSize
- The size of the beam that should be used when decoding sequences.sequenceValidator
- The SequenceValidator
to determines whether the outcome
is valid for the preceding sequence. This can be used to implement constraints
on what sequences are valid.public ChunkerME(ChunkerModel model, int beamSize)
model
- The model for this chunker.cacheSize
- beamSize
- The size of the beam that should be used when decoding sequences.public ChunkerME(ChunkerModel model)
model
- @Deprecated public ChunkerME(opennlp.model.MaxentModel mod)
mod
- The maximum entropy model for this chunker.@Deprecated public ChunkerME(opennlp.model.MaxentModel mod, ChunkerContextGenerator cg)
mod
- The maximum entropy model for this chunker.cg
- The context generator to be used by the specified model.@Deprecated public ChunkerME(opennlp.model.MaxentModel mod, ChunkerContextGenerator cg, int beamSize)
mod
- The maximum entropy model for this chunker.cg
- The context generator to be used by the specified model.beamSize
- The size of the beam that should be used when decoding sequences.Method Detail |
---|
public List<String> chunk(List<String> toks, List<String> tags)
Chunker
chunk
in interface Chunker
toks
- a list of the tokens or words of the sequence.tags
- a list of the pos tags of the sequence.
public String[] chunk(String[] toks, String[] tags)
Chunker
chunk
in interface Chunker
toks
- an array of the tokens or words of the sequence.tags
- an array of the pos tags of the sequence.
public Sequence[] topKSequences(List<String> sentence, List<String> tags)
Chunker
topKSequences
in interface Chunker
sentence
- The tokens of the sentence.tags
- The pos-tags for the specified sentence.
public Sequence[] topKSequences(String[] sentence, String[] tags, double minSequenceScore)
Chunker
topKSequences
in interface Chunker
sentence
- The tokens of the sentence.tags
- The pos-tags for the specified sentence.
public void probs(double[] probs)
chunk
. The
specified array should be at least as large as the numbe of tokens in the previous call to chunk
.
probs
- An array used to hold the probabilities of the last decoded sequence.public double[] probs()
chunk
.
chunk
when it was last called.public static ChunkerModel train(String lang, ObjectStream<ChunkSample> in, int cutoff, int iterations, ChunkerContextGenerator contextGenerator) throws IOException
IOException
public static ChunkerModel train(String lang, ObjectStream<ChunkSample> in, int cutoff, int iterations) throws IOException, ObjectStreamException
ChunkerME
.
es
- iterations
- cutoff
-
IOException
ObjectStreamException
@Deprecated public static void main(String[] args) throws IOException, ObjectStreamException
args
- The training file and the model file.
IOException
- When the specified files can not be read.
ObjectStreamException
|
||||||||||
PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD |