Was looking to get peoples thoughts on keeping a Lucene index up to date as changes are made to the domain model objects of an application.
The application in question ... |
Is there a known math formula that I can use to estimate the size of a new Lucene index? I know how many fields I want to have indexed, and the ... |
I need to index bi-grams of words (tokens) in Lucene. I can produce n-grams and than index them, but I am wondering if there is something in Lucene which will do ... |
In my datasource there are a lot of special characters like forward slash, minus, plus etc. A lot of these characters bring problems to lucene.
That's why I decided to encode ... |
I am thinking of adding JMX bean for taking hot backup of lucene index.
LuceneMBean mbean = new LuceneMBeanImpl();
ObjectName name = new ObjectName("indexing.index:type=lucene");
MBeanServer mbs = ManagementFactory.getPlatformMBeanServer();
... |
I'm starting from a Lucene index which someone else created. I'd like to find all of the words that follow a given word. I've extracted the term (org.apache.lucene.index.Term) of ... |
I'm trying to come up with a way to query dates in Lucene. Basically I have an event that has a start date, end date and can also occur regularly. The ... |
|
Is it possible to compare data from multiple Lucene indexes? I would like to get documents that have the same value in similar fields (like first name, last name) across two ... |
I'm working on a project for which I want to build a tag cloud by reading a Lucene index and pruning it down. I didn't set up the Lucene engine, it ... |
I had an issue with my search not return the results I expect.
I tried to run Luke on my index, but it said it was locked and I needed to Force ... |
I have a StackOverflow-like system where content is organised into threads, each thread having content of its own (the question body / text), and posts / replies.
I'm producing the ability to ... |
I have some BigDecimal values which should be indexed for searching. Lucene has NumericField but it has setters only for long, double, float and int. I could store it as a ... |
what is the best way to split an existing Lucene index into two halves i.e. each split should contain half of the total number of documents in the original index
|
I'm trying to figure out the right way to read lucene index only once whilst running the application multiple times, how can I do that in java?
Because indexed data will ... |
What is the best way to achieve Lucene real-time indexing?
|
I am looking for a keyword indexing library for java. I found Lucene in google search. I think it is a very popular one but just wondering if it is the ... |
I am using Lucene to index my documents. In my case, each document is rather in small size but having a large quantity (~2GB). And in each document, there are many ... |
I have just started learning Lucene and would like to use it for indexing a table in an existing database. The way I have been thinking about this so far has ... |
Does anyone know of any publicly available Question answering applications which have built using lucene on TREC data?
Thanks
|
i am storing in lucene index ngrams up to level 3. When I am reading the index and calculating scoring of terms and ngrams I am obtaining results like this
TERM ...
|
I want to index a "compound word" like "New York" as a single term in Lucene not like "new", "york". In such a way that if someone searches for "new place", ... |
I'd like to speed-up the indexing of 10GB of data into a Lucene index. Would TPL be a good way to do this? Would I need to divided the ... |
i haven't found the answer to my problem so I decided to write my question to get some help.
I use lucene to index the objects in computer memory(they exist only in ... |
I am using Lucandra and want to drop an entire index. The IndexReader and IndexWriters don't have all methods implemented so even iterating through a call to deleteDocument(int docNum) isn't possible. ... |
I would like to ask about lucene index. I mean I created a simple
program that created lucene indexes and stored it in a folder. also I
had use a diagnostic tools name ... |
What is the best way to store the object relationships in lucene? I need to be able to read them back easily and fast. Currently they are not stored in the ... |
I want read every single index. I want read and print to console the single term in my index. (I don't want view the contents with Luke). I must use the ... |
Is it OK to create a term for each number in a text?
Example text:
I got 2295910 unique terms.
The numbers can be timestamps, port numbers, anything. The unique numbers lead to a ... |
I m developing a User interface for Katta Index. For that I need to find, the indexed field's names (and if possible its data type how it was indexed either normal ... |
How to get total number of document in Katta index??
--Thanks in advance
|
System properties:
Java 1.6
Mac OSX version 10
Ant 1.8
Scenario: I am working on my final year project which is to create a website which uses Lucene to search my website and my ... |
I'm new to Lucene. If I use
description = new TermQuery(new Term("description", "my string"));
I ask Lucene to consider "my string" as unique word, right ?
I actually need to consider each word, ... |
We have indexed our entire sourcecode base using Lucene and have a stand-alone application to display matches. Is there a way to integrate the index into Visual Studio so we can ... |
I am using Lucene 3.0.3. In preparation to using SpanQuery and PhraseQuery, I would like to mark paragraph boundaries in my index in a way that will discourage these queries from ... |
I want to know that in lucene excatly which class make inverted index??
Tnanks
|
What are the indexes in Lucene and how it works?
I have gone through some articles on net and google but I could not understand the concept of the index, documents etc ... |
What is the best practice to backup a lucene index without taking the index offline (hot backup)?
|
Hi
We are looking for a searching mechanism in our application that contains a lot of relational tables containing CLOB content. We have requirements to allow our users to search ... |
In Lucene, what are the relationships between "attribute" and "document index"? How does the index look like in terms of its implementation?
|
I'm developing context discover system - which is mix of searching and suggestions.
Currently I'm looking for library for indexing.
After some investigation I stayed on Lucene and Terrier ... |
The story is this. I want to mimic the behavior of a relational database using a Lucene index in java. I need to be able to do searching(reading) and writing at ... |
I have data like
1 2 3 4 5 6 7 8 9 10 12 13 14 15 16 17 18 19 20 22 23 24 25 26 28 30 36 ...
|
Are the following lines of code acceptable to get a hot backup of a lucene index or IndexWriter/SnapshotDeletionPolicy as described in Lucene index backup should be followed?
Directory dir = ...;
IndexReader ...
|
I'm working on a small lucene project, where i have to index a bunch of text files. so far i've managed to create the index, i think. the code runs and ... |
I can't to understand what difference between to ways of indexing: ANALYZED and ANALYZED_NO_NORMS. I did'n understand it using Lucene javadoc.
Who can tell me more what is NORMS? What benefits or ... |
Is Lucene capable of indexing 500M text documents of 50K each?
What performance can be expected such index, for single term search and for 10 terms search?
Should I be worried and directly ... |
while describing a system having a three tier architecture, which tier would Lucene come under-the Business Logic or the Data Tier?
|
At first I have to say that I am completely new to Lucene and therefore my question may be a little bit stupid.
What I would like to do is to create ... |
Any idea which hash algorithm is used while Indexing each word in lucene
|
I want to index two different OntModel and execute sparql queries on them. For example:
String query = "PREFIX fise: <http://fise.iks-project.eu/ontology/>\n"
...
|
I'm current investigating the options to extract person names, locations, tech words and categories from text (a lot articles from the web) which will then feeded into a Lucene/ElasticSearch index. The ... |
We want to use lucene in our J2EE web application. We want create separate lucene service(Which will be deployed in separate JBOSS server) for Lucene related functionality (like indexwriting/searching documents).
We will ... |
Is it possible to use Lucene Benchmark to index a wikipedia dump? I want to be able to execute phrase queries on the latest english wikipedia page dump. I'm trying to ... |
Say I have two indexes A, B where they have fields A.ID A.Blah B.ID B.Foo where A.ID could equal B.ID. I guess I could use two IndexSearches and manually loop through ... |
I am trying to compare 2 indexes of a string. Basically if the first index of a string is not equal to the second index, create new searchIndex() method in void ... |
We are trying to improve Lucene Index writting by writting on SSD Drive (100 GB OCZ RevoDrive X2). But, we are even getting slow results. Below is some results.
Index Writing on ... |
Do we have to set the MergePolicy for IndexWriter every time an index is updated (incrementally) or is it fine if MergePolicy is set to IndexWriter only for the first time ... |
From what I understand, the demo IndexFiles example in the Lucene contributions directory will create an inverted index from document terms to the corresponding document pathnames.
I was wondering if there was ... |
I want to index histogram information from images with Lucene.
I need to know what Lucene class I should modify to tell Lucene that it shouldn't read file contents (clearly there wouldn't ... |
Hi all. I'm having a problem with my search engine which is build on lucene. It's made of a crawler part and a search part. When the engine starts, it load the index and the search part is available. Then I start the crawl service which recreate the index. At the end, I recreate the IndexReader object. But it seems it ... |
Hi all, I'm developing a search tool for a web site using lucene. One issue i need you to clarify is where we should store the generated lucene indexes ? lucene provides a file system storage method FSDirectory.so we need to specify a path like (/usr/lucene on linux) or (c:/lucene on windows).Im wondering how we can store the indexes in remote ... |
|