Java edu.stanford.nlp.process PTBTokenizer fields, constructors, methods, implement or subclass

Example usage for Java edu.stanford.nlp.process PTBTokenizer fields, constructors, methods, implement or subclass

Introduction

In this page you can find the methods, fields and constructors for edu.stanford.nlp.process PTBTokenizer.

The text is from its open source code.

	PTBTokenizer(final Reader r, final LexedTokenFactory tokenFactory, final String options) Constructs a new PTBTokenizer with a custom LexedTokenFactory.

TokenizerFactory	factory() This is a historical constructor that returns Word tokens.
TokenizerFactory	factory(boolean tokenizeNLs, boolean invertible)
TokenizerFactory	factory(LexedTokenFactory factory, String options) Get a TokenizerFactory that does Penn Treebank tokenization.
String	getNewlineToken() Returns the string literal inserted for newlines when the -tokenizeNLs options is set.
PTBTokenizer	newPTBTokenizer(Reader r) Constructs a new PTBTokenizer that returns Word tokens and which treats carriage returns as normal whitespace.
PTBTokenizer	newPTBTokenizer(Reader r, boolean tokenizeNLs, boolean invertible) Constructs a new PTBTokenizer that makes CoreLabel tokens.
String	ptb2Text(String ptbText) Returns a presentable version of the given PTB-tokenized text.
String	ptb2Text(List ptbWords) Returns a presentable version of the given PTB-tokenized words.
String	ptbToken2Text(String ptbText) Returns a presentable version of a given PTB token.