List of usage examples for org.apache.mahout.vectorizer Vectorizer interface-usage
From source file com.elex.dmp.vectorizer.DictionaryVectorizer.java
/**
* This class converts a set of input documents in the sequence file format to vectors. The Sequence file
* input should have a {@link Text} key containing the unique document identifier and a {@link StringTuple}
* value containing the tokenized document. You may use {@link DocumentProcessor} to tokenize the document.
* This is a dictionary based Vectorizer.
*/
From source file com.elex.dmp.vectorizer.FixDictionaryVectorizer.java
/**
* This class converts a set of input documents in the sequence file format to vectors. The Sequence file
* input should have a {@link Text} key containing the unique document identifier and a {@link StringTuple}
* value containing the tokenized document. You may use {@link DocumentProcessor} to tokenize the document.
* This is a dictionary based Vectorizer.
*/