apache-nutch Search Results

353 results
for apache-nutch

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

apache/lucene #6207

rm or formalize dealing with "general" KEYS files in our dis…

At some point in the past, we started creating a snapshots of KEYS (taken from the auto-generated data from id.apache.org) in the release dir of each release... http://www.apache.org/dist/lucene/solr…

asfimport updated 2 years ago
40
apache/lucene #2073

Add search timeout support to Lucene [LUCENE-997]

This patch is based on Nutch-308. This patch adds support for a maximum search time limit. After this time is exceeded, the search thread is stopped, partial results (if any) are returned and the to…

asfimport updated 2 years ago
54
apache/lucene #2451

Add HTMLStripReader and WordDelimiterFilter from SOLR [LUCEN…

SOLR has two classes HTMLStripReader and WordDelimiterFilter which are very useful for a wide variety of use cases. It would be good to place them into core Lucene. --- Migrated from [LUCENE-1377]…

asfimport updated 2 years ago
35
apache/lucene #4991

Port index sorter to trunk APIs [LUCENE-3918]

#3556 added an IndexSorter to 3.x, but we need to port this functionality to 4.0 apis. --- Migrated from [LUCENE-3918](https://issues.apache.org/jira/browse/LUCENE-3918) by Robert Muir (@rmuir), 2 …

asfimport updated 2 years ago
62
apache/lucene #3109

Massive Code Duplication in Contrib Analyzers - unifly the a…

Due to the variouse tokenStream APIs we had in lucene analyzer subclasses need to implement at least one of the methodes returning a tokenStream. When you look at the code it appears to be almost iden…

asfimport updated 2 years ago
51
apache/lucene #3529

Some house cleaning in addIndexes* [LUCENE-2455]

Today, the use of addIndexes and addIndexesNoOptimize is confusing - especially on when to invoke each. Also, addIndexes calls optimize() in the beginning, but only on the target index. It also incl…

asfimport updated 2 years ago
62
apache/lucene #2680

Automaton Query/Filter (scalable regex) [LUCENE-1606]

Attached is a patch for an AutomatonQuery/Filter (name can change if its not suitable). Whereas the out-of-box contrib RegexQuery is nice, I have some very large indexes (100M+ unique tokens) where q…

asfimport updated 2 years ago
224
apache/lucene #3731

Replace Maven POM templates with full POMs, and change docum…

The current Maven POM templates only contain dependency information, the bare bones necessary for uploading artifacts to the Maven repository. The full Maven POMs in the attached patch include the in…

asfimport updated 2 years ago
82
apache/lucene #2649

Refactoring Lucene collectors (HitCollector and extensions) …

This issue is a result of a recent discussion we've had on the mailing list. You can read the thread [here](http://www.nabble.com/Is-TopDocCollector%27s-collect()-implementation-correct--td22557419.ht…

asfimport updated 2 years ago
130
apache/lucene #3065

Add unsigned packed int impls in oal.util [LUCENE-1990]

There are various places in Lucene that could take advantage of an efficient packed unsigned int/long impl. EG the terms dict index in the standard codec in #2532 could subsantially reduce it's RAM u…

asfimport updated 2 years ago
73

上一页 1...11 12 13 14 15 16 17...36 下一页

353 results for apache-nutch

353 results
for apache-nutch