Example usage for org.apache.lucene.analysis.hu HungarianAnalyzer HungarianAnalyzer

List of usage examples for org.apache.lucene.analysis.hu HungarianAnalyzer HungarianAnalyzer

Introduction

In this page you can find the example usage for org.apache.lucene.analysis.hu HungarianAnalyzer HungarianAnalyzer.

Prototype

public HungarianAnalyzer(CharArraySet stopwords, CharArraySet stemExclusionSet) 

Source Link

Document

Builds an analyzer with the given stop words.

Usage

From source file:org.elasticsearch.analysis.common.HungarianAnalyzerProvider.java

License:Apache License

HungarianAnalyzerProvider(IndexSettings indexSettings, Environment env, String name, Settings settings) {
    super(indexSettings, name, settings);
    analyzer = new HungarianAnalyzer(
            Analysis.parseStopWords(env, settings, HungarianAnalyzer.getDefaultStopSet()),
            Analysis.parseStemExclusion(settings, CharArraySet.EMPTY_SET));
    analyzer.setVersion(version);/*from  ww  w  .ja v  a  2  s .co m*/
}

From source file:org.omegat.tokenizer.LuceneHungarianTokenizer.java

License:Open Source License

@Override
protected TokenStream getTokenStream(final String strOrig, final boolean stemsAllowed,
        final boolean stopWordsAllowed) {
    if (stemsAllowed) {
        Set<?> stopWords = stopWordsAllowed ? HungarianAnalyzer.getDefaultStopSet() : Collections.EMPTY_SET;
        return new HungarianAnalyzer(getBehavior(), stopWords).tokenStream("", new StringReader(strOrig));
    } else {//from w  ww  .  jav a2  s  .  c  o  m
        return new StandardTokenizer(getBehavior(), new StringReader(strOrig));
    }
}