Java org.apache.lucene.analysis.miscellaneous WordDelimiterFilter fields, constructors, methods, implement or subclass

Example usage for Java org.apache.lucene.analysis.miscellaneous WordDelimiterFilter fields, constructors, methods, implement or subclass

Introduction

In this page you can find the methods, fields and constructors for org.apache.lucene.analysis.miscellaneous WordDelimiterFilter.

The text is from its open source code.

Field

intLOWER
intUPPER
intDIGIT
intSUBWORD_DELIM
intALPHA
intALPHANUM
intGENERATE_WORD_PARTS
Causes parts of words to be generated:

"PowerShot" => "Power" "Shot"

intGENERATE_NUMBER_PARTS
Causes number subwords to be generated:

"500-42" => "500" "42"

intCATENATE_WORDS
Causes maximum runs of word parts to be catenated:

"wi-fi" => "wifi"

intPRESERVE_ORIGINAL
Causes original words are preserved and added to the subword list (Defaults to false)

"500-42" => "500" "42" "500-42"

intSPLIT_ON_CASE_CHANGE
If not set, causes case changes to be ignored (subwords will only be generated given SUBWORD_DELIM tokens)
intSPLIT_ON_NUMERICS
If not set, causes numeric changes to be ignored (subwords will only be generated given SUBWORD_DELIM tokens).
intSTEM_ENGLISH_POSSESSIVE
Causes trailing "'s" to be removed for each subword

"O'Neil's" => "O", "Neil"

Constructor

WordDelimiterFilter(TokenStream in, int configurationFlags, CharArraySet protWords)
Creates a new WordDelimiterFilter using WordDelimiterIterator#DEFAULT_WORD_DELIM_TABLE as its charTypeTable
WordDelimiterFilter(TokenStream in, byte[] charTypeTable, int configurationFlags, CharArraySet protWords)
Creates a new WordDelimiterFilter