Class | Description |
---|---|
CachedFile | |
POSTag |
The OpenNLP POSTag UDF tags bags of sequential words with parts of speech and confidence levels using the OpenNLP
toolset, and specifically the POSTaggerME class.
|
SentenceDetect |
The OpenNLP SentenceDectectors segment an input paragraph into sentences.
|
TokenizeME |
The OpenNLP Tokenizers segment an input character sequence into tokens using the OpenNLP TokenizeME class, which is
a probabilistic, 'maximum entropy' classifier.
|
TokenizeSimple |
The OpenNLP Tokenizers segment an input character sequence into tokens.
|
TokenizeWhitespace |
The OpenNLP Tokenizers segment an input character sequence into tokens.
|