Example usage for weka.filters.unsupervised.attribute StringToWordVector setNormalizeDocLength

List of usage examples for weka.filters.unsupervised.attribute StringToWordVector setNormalizeDocLength

Introduction

In this page you can find the example usage for weka.filters.unsupervised.attribute StringToWordVector setNormalizeDocLength.

Prototype

public void setNormalizeDocLength(SelectedTag newType) 

Source Link

Document

Sets whether if the word frequencies for a document (instance) should be normalized or not.

Usage

From source file:nl.uva.expose.classification.WekaClassification.java

private void getWordVector(Instances dRaw, Instances dFiltered) throws Exception {
    StringToWordVector filter = new StringToWordVector();
    filter.setAttributeIndices("first-last");
    filter.setIDFTransform(true);//from w w w . j a v a 2  s  .c o  m
    filter.setLowerCaseTokens(true);
    filter.setMinTermFreq(2);
    filter.setLowerCaseTokens(true);
    filter.setNormalizeDocLength(
            new SelectedTag(StringToWordVector.FILTER_NORMALIZE_ALL, StringToWordVector.TAGS_FILTER));
    filter.setOutputWordCounts(true);
    //        filter.setTokenizer();
    //        filter.setWordsToKeep();
    filter.setInputFormat(dRaw);
    dFiltered = Filter.useFilter(dRaw, filter);
}