Java org.apache.mahout.vectorizer DocumentProcessor fields, constructors, methods, implement or subclass

Example usage for Java org.apache.mahout.vectorizer DocumentProcessor fields, constructors, methods, implement or subclass

Introduction

In this page you can find the methods, fields and constructors for org.apache.mahout.vectorizer DocumentProcessor.

The text is from its open source code.

Field

Method

voidtokenizeDocuments(Path input, Class analyzerClass, Path output, Configuration baseConf)
Convert the input documents into token array using the StringTuple The input documents has to be in the org.apache.hadoop.io.SequenceFile format