I'm looking for feedback on which analyzer to use with an index that has documents from multiple languages. Currently I am using the simpleanalyzer, as it seems to handle the broadest ...
I am using Lucene (or more specifically Compass), to log threads in a forum and I need a way to extract the keywords behind the discussion. That said, I don't want ...
With lucene 2.9.1, INDEX.TOKENIZED is deprecated. The documentation says it is just renamed to ANALYZER, but I don't think the meaning has stayed the same. I have an existing ...
Hi I am building a search application using lucene. Some of my queries are complex. For example, My documents contain the fields location and population where location is a not-analyzed field ...
I know what does not_analyzed mean. In short the field will not be tokenized by specified Analyzer.
However, what does a NO_NORMS means? I see the documentation, but please explain ...
How do I add compass analyzer while indexing and searching data in compass.I am using schema based configuration for compass.I want to use StandardAnalyzer with no stopwords.Because I want to index ...
Using StandardAnalyzer, my Lucene contain numeric terms too(i.e. "200"). So number of term in my index is too big.
Does anyone know if exists an Analyzer or Tokenizer that discards ...