-
At some point in the past, we started creating a snapshots of KEYS (taken from the auto-generated data from id.apache.org) in the release dir of each release...
http://www.apache.org/dist/lucene/solr…
-
This patch is based on Nutch-308.
This patch adds support for a maximum search time limit. After this time is exceeded, the search thread is stopped, partial results (if any) are returned and the to…
-
SOLR has two classes HTMLStripReader and WordDelimiterFilter which are very useful for a wide variety of use cases. It would be good to place them into core Lucene.
---
Migrated from [LUCENE-1377]…
-
#3556 added an IndexSorter to 3.x, but we need to port this
functionality to 4.0 apis.
---
Migrated from [LUCENE-3918](https://issues.apache.org/jira/browse/LUCENE-3918) by Robert Muir (@rmuir), 2 …
-
Due to the variouse tokenStream APIs we had in lucene analyzer subclasses need to implement at least one of the methodes returning a tokenStream. When you look at the code it appears to be almost iden…
-
Today, the use of addIndexes and addIndexesNoOptimize is confusing -
especially on when to invoke each. Also, addIndexes calls optimize() in
the beginning, but only on the target index. It also incl…
-
Attached is a patch for an AutomatonQuery/Filter (name can change if its not suitable).
Whereas the out-of-box contrib RegexQuery is nice, I have some very large indexes (100M+ unique tokens) where q…
-
The current Maven POM templates only contain dependency information, the bare bones necessary for uploading artifacts to the Maven repository.
The full Maven POMs in the attached patch include the in…
-
This issue is a result of a recent discussion we've had on the mailing list. You can read the thread [here](http://www.nabble.com/Is-TopDocCollector%27s-collect()-implementation-correct--td22557419.ht…
-
There are various places in Lucene that could take advantage of an
efficient packed unsigned int/long impl. EG the terms dict index in
the standard codec in #2532 could subsantially reduce it's RAM
u…