Example usage for org.apache.lucene.analysis.core UpperCaseFilter UpperCaseFilter

List of usage examples for org.apache.lucene.analysis.core UpperCaseFilter UpperCaseFilter

Introduction

In this page you can find the example usage for org.apache.lucene.analysis.core UpperCaseFilter UpperCaseFilter.

Prototype

public UpperCaseFilter(TokenStream in) 

Source Link

Document

Create a new UpperCaseFilter, that normalizes token text to upper case.

Usage

From source file:edu.upenn.library.solrplugins.tokentype.TokenTypeProcessFilterTest.java

License:Apache License

public void test() throws IOException {
    String test = "The quick red fox jumped over the lazy brown dogs";

    TokenTypeProcessFilter ttpf = new TokenTypeProcessFilter(new Blah(whitespaceMockTokenizer(test)),
            Collections.singleton("even"), Collections.EMPTY_SET, "even_processed");
    ttpf.setDelegate(new UpperCaseFilter(ttpf));
    int count = 0;
    TypeAttribute typeAtt = ttpf.getAttribute(TypeAttribute.class);
    OffsetAttribute offsetAtt = ttpf.getAttribute(OffsetAttribute.class);
    PositionIncrementAttribute posIncrAtt = ttpf.getAttribute(PositionIncrementAttribute.class);
    CharTermAttribute termAtt = ttpf.getAttribute(CharTermAttribute.class);
    int lastEndOffset = -1;
    ttpf.reset();/*www .j  a  v  a  2s  .c  o  m*/
    while (ttpf.incrementToken()) {
        String term = termAtt.toString();
        String type = typeAtt.type();
        int startOffset = offsetAtt.startOffset();
        int endOffset = offsetAtt.endOffset();
        int posIncr = posIncrAtt.getPositionIncrement();
        assertEquals(1, posIncr);
        assertEquals(lastEndOffset + 1, startOffset);
        switch (count % 2) {
        case 0:
            assertEquals(term.toUpperCase(), term);
            assertEquals("even_processed", type);
            break;
        case 1:
            assertTrue(!term.equals(term.toUpperCase()));
            break;
        }
        lastEndOffset = endOffset;
        count++;
    }
    assertTrue(count + " does not equal: " + 10, count == 10);

}

From source file:org.elasticsearch.analysis.common.UpperCaseTokenFilterFactory.java

License:Apache License

@Override
public TokenStream create(TokenStream tokenStream) {
    return new UpperCaseFilter(tokenStream);
}