daevaorn / djapian

High level Xapian integration for Django
Other
6 stars 3 forks source link

Per-indexer permanent stemmer and stopper #109

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
I would like to have a way to provide my custom stemmer and stopper when
registering an indexer for my models.

First of all, adding a per-indexer stemmer would remove per-row creation of
Xapian.Stem(lang) which is done by the current code (small performance
increase).

That would also allow to use a custom stemmer (e.g.
ispell/aspell/myspell/hunspell - based one) instead of algorithmic
Snowball-based stemmer (Xapian.Stem).

Issue #72 would be merged into this one.

Original issue reported on code.google.com by esizi...@gmail.com on 20 Mar 2010 at 6:41

GoogleCodeExporter commented 9 years ago
In r358 the `get_stemmer()' function has been introduced for Indexer class. 
User can now override it to have custom stemmers when Xapian will support them, 
or to get a per-indexer permanent stemmer object (with memoization for example).

The question is should we support memoization by default?

Original comment by esizi...@gmail.com on 15 Jun 2010 at 8:25

GoogleCodeExporter commented 9 years ago
There's a ticket regarding custom stemmers in Xapian Trac 
http://trac.xapian.org/ticket/448 which seems like has been fixed recently.

Original comment by esizi...@gmail.com on 21 Jun 2010 at 3:13

GoogleCodeExporter commented 9 years ago
So we can close it.

Original comment by daevaorn on 21 Jun 2010 at 8:59