-
huge documents can be drastically slower than need be because the entire field is added to the memory index
this cost can be greatly reduced in many cases if we try and respect maxDocCharsToAnalyze
t…
-
As of Lucene 3.1, StandardTokenizer implements UAX#29 word boundary rules to provide language-neutral tokenization. Lucene contains several language-specific tokenizers that should be replaced by UAX…
-
Per discussion at Committer's Call 2/19/22.
## Context
Blacklight currently (and for a long history) turns any error from Rsolr connection into a redisplay of search page with the message "So…
-
sorting can be much slower on trunk than branch_3x
---
Migrated from [LUCENE-2504](https://issues.apache.org/jira/browse/LUCENE-2504) by Yonik Seeley (@yonik), resolved Sep 18 2012
Attachments: [LU…
-
Today, applications have to open/close an IndexWriter and open/close an
IndexReader directly or indirectly (via IndexModifier) in order to handle a
mix of inserts and deletes. This performs well when …
-
This issue proposes some refactoring to the benchmark package. Today, DocMaker has two roles: collecting documents from a collection and preparing a Document object. These two should actually be split…
-
To maintain serialization compatibility between Lucene versions, serialVersionUID needs to be added to classes that implement java.io.Serializable. java.io.Externalizable may be implemented in classe…
-
Hi all,
in version 2 of the Norconex Solr Commiter there is a configuration "sourceContenField". This field allow us, to use a metadata field for document content. In Version 3 this field doesn't see…
-
I would like to explore contributing a geo3d package to Lucene. This can be used in conjunction with Lucene search, both for generating geohashes (via spatial4j) for complex geographic shapes, as wel…
-
The ISOLatin1AccentFilter is removing accents from accented characters in the ISO Latin 1 character set.
It does what it does and there is no bug with it.
It would be nicer, though, if there was a mo…