filter « document « Java Lucene Q&A

Home
Java Lucene Q&A
1.Database
2.Development
3.document
4.Field
5.index
6.lucene
7.lucene.net
8.nutch
9.query
10.solr
11.Tools
Java Lucene Q&A » document » filter 

1. Lucene: Filtering for documents NOT containing a Term    stackoverflow.com

I have an index whose documents have two fields (actually more like 800 fields but the other fields won't concern us here):

  • The contents field contains the analyzed/tokenized text of the document. ...

2. How do I determine if a document is an article?    stackoverflow.com

Say I've X documents what algorithm/library/tika config/nekohtml filter would tell me which of those is an "article" and which is not, and for those that are give me the article text ...

3. How to filter results in Solr MLTHandler to exclude documents containing blacklisted words?    stackoverflow.com

I have a solr index which contains millions of textual documents, all submitted by users. Quite a lot of these documents are potential spam. In my webapp, I show a "related ...

java2s.com  | Contact Us | Privacy Policy
Copyright 2009 - 12 Demo Source and Support. All rights reserved.
All other trademarks are property of their respective owners.