List of usage examples for org.apache.lucene.analysis CharFilter subclass-usage
From source file com.github.cstoku.neologd.unidic.lucene.analysis.ja.JapaneseIterationMarkCharFilter.java
/**
* Normalizes Japanese horizontal iteration marks (odoriji) to their expanded form.
* <p>
* Sequences of iteration marks are supported. In case an illegal sequence of iteration
* marks is encountered, the implementation emits the illegal source character as-is
* without considering its script. For example, with input "??", we get
From source file com.zimbra.cs.index.analysis.NormalizeTokenFilter.java
/**
* Normalizes token text:
* <ul>
* <li>full-width alnum to half-width alnum
* <li>half-width katakana to full-width katakana
* <li>accented character to ASCII equivalent ({@link org.apache.lucene.analysis.ASCIIFoldingFilter})