| Class | Description |
|---|---|
| CachedFile | |
| POSTag |
The OpenNLP POSTag UDF tags bags of sequential words with parts of speech and confidence levels using the OpenNLP
toolset, and specifically the POSTaggerME class.
|
| SentenceDetect |
The OpenNLP SentenceDectectors segment an input paragraph into sentences.
|
| TokenizeME |
The OpenNLP Tokenizers segment an input character sequence into tokens using the OpenNLP TokenizeME class, which is
a probabilistic, 'maximum entropy' classifier.
|
| TokenizeSimple |
The OpenNLP Tokenizers segment an input character sequence into tokens.
|
| TokenizeWhitespace |
The OpenNLP Tokenizers segment an input character sequence into tokens.
|