Example usage for org.apache.lucene.analysis.pt PortugueseAnalyzer PortugueseAnalyzer

List of usage examples for org.apache.lucene.analysis.pt PortugueseAnalyzer PortugueseAnalyzer

Introduction

In this page you can find the example usage for org.apache.lucene.analysis.pt PortugueseAnalyzer PortugueseAnalyzer.

Prototype

public PortugueseAnalyzer(CharArraySet stopwords, CharArraySet stemExclusionSet) 

Source Link

Document

Builds an analyzer with the given stop words.

Usage

From source file:org.elasticsearch.analysis.common.PortugueseAnalyzerProvider.java

License:Apache License

PortugueseAnalyzerProvider(IndexSettings indexSettings, Environment env, String name, Settings settings) {
    super(indexSettings, name, settings);
    analyzer = new PortugueseAnalyzer(
            Analysis.parseStopWords(env, settings, PortugueseAnalyzer.getDefaultStopSet()),
            Analysis.parseStemExclusion(settings, CharArraySet.EMPTY_SET));
    analyzer.setVersion(version);/*w w  w . ja  v  a  2  s  .c om*/
}

From source file:org.omegat.tokenizer.LucenePortugueseTokenizer.java

License:Open Source License

@Override
protected TokenStream getTokenStream(final String strOrig, final boolean stemsAllowed,
        final boolean stopWordsAllowed) {
    if (stemsAllowed) {
        Set<?> stopWords = stopWordsAllowed ? PortugueseAnalyzer.getDefaultStopSet() : Collections.EMPTY_SET;
        return new PortugueseAnalyzer(getBehavior(), stopWords).tokenStream("", new StringReader(strOrig));
    } else {/*from  ww  w . j ava2s . c  o  m*/
        return new StandardTokenizer(getBehavior(), new StringReader(strOrig));
    }
}