Example usage for Java org.apache.lucene.analysis.miscellaneous WordDelimiterGraphFilter fields, constructors, methods, implement or subclass
The text is from its open source code.
int | GENERATE_WORD_PARTS Causes parts of words to be generated: "PowerShot" => "Power" "Shot" |
int | GENERATE_NUMBER_PARTS Causes number subwords to be generated: "500-42" => "500" "42" |
int | CATENATE_WORDS Causes maximum runs of word parts to be catenated: "wi-fi" => "wifi" |
int | CATENATE_NUMBERS Causes maximum runs of number parts to be catenated: "500-42" => "50042" |
int | CATENATE_ALL Causes all subword parts to be catenated: "wi-fi-4000" => "wifi4000" |
int | PRESERVE_ORIGINAL Causes original words are preserved and added to the subword list (Defaults to false) "500-42" => "500" "42" "500-42" |
int | SPLIT_ON_CASE_CHANGE Causes lowercase -> uppercase transition to start a new subword. |
int | SPLIT_ON_NUMERICS If not set, causes numeric changes to be ignored (subwords will only be generated given SUBWORD_DELIM tokens). |
int | STEM_ENGLISH_POSSESSIVE Causes trailing "'s" to be removed for each subword "O'Neil's" => "O", "Neil" |
WordDelimiterGraphFilter(TokenStream in, int configurationFlags, CharArraySet protWords) Creates a new WordDelimiterGraphFilter using WordDelimiterIterator#DEFAULT_WORD_DELIM_TABLE as its charTypeTable |