Example usage for Java edu.stanford.nlp.process PTBTokenizer fields, constructors, methods, implement or subclass
The text is from its open source code.
PTBTokenizer(final Reader r, final LexedTokenFactory Constructs a new PTBTokenizer with a custom LexedTokenFactory. |
TokenizerFactory | factory() This is a historical constructor that returns Word tokens. |
TokenizerFactory | factory(boolean tokenizeNLs, boolean invertible) |
TokenizerFactory | factory(LexedTokenFactory Get a TokenizerFactory that does Penn Treebank tokenization. |
String | getNewlineToken() Returns the string literal inserted for newlines when the -tokenizeNLs options is set. |
PTBTokenizer | newPTBTokenizer(Reader r) Constructs a new PTBTokenizer that returns Word tokens and which treats carriage returns as normal whitespace. |
PTBTokenizer | newPTBTokenizer(Reader r, boolean tokenizeNLs, boolean invertible) Constructs a new PTBTokenizer that makes CoreLabel tokens. |
String | ptb2Text(String ptbText) Returns a presentable version of the given PTB-tokenized text. |
String | ptb2Text(List Returns a presentable version of the given PTB-tokenized words. |
String | ptbToken2Text(String ptbText) Returns a presentable version of a given PTB token. |