Example usage for org.apache.mahout.vectorizer Vectorizer interface-usage

List of usage examples for org.apache.mahout.vectorizer Vectorizer interface-usage

Introduction

In this page you can find the example usage for org.apache.mahout.vectorizer Vectorizer interface-usage.

Usage

From source file com.elex.dmp.vectorizer.DictionaryVectorizer.java

/**
 * This class converts a set of input documents in the sequence file format to vectors. The Sequence file
 * input should have a {@link Text} key containing the unique document identifier and a {@link StringTuple}
 * value containing the tokenized document. You may use {@link DocumentProcessor} to tokenize the document.
 * This is a dictionary based Vectorizer.
 */

From source file com.elex.dmp.vectorizer.FixDictionaryVectorizer.java

/**
 * This class converts a set of input documents in the sequence file format to vectors. The Sequence file
 * input should have a {@link Text} key containing the unique document identifier and a {@link StringTuple}
 * value containing the tokenized document. You may use {@link DocumentProcessor} to tokenize the document.
 * This is a dictionary based Vectorizer.
 */