faustedition / faust-web

Web frontend for the edition of Goethe's Faust
http://faustedition.net/
3 stars 1 forks source link

Cannot search for umlaut + wildcard in meta texts #564

Open thvitt opened 5 years ago

thvitt commented 5 years ago

röntgen* → 0 Treffer rontgen* → 2 Treffer 'Röntgenfloureszenzanalyse'

thvitt commented 5 years ago

Röntgenflouresnzenzanalyse → 1 Treffer

so its the combination of wildcard and umlaut

thvitt commented 5 years ago

The problem also occurs in the text when using the default (GermanAnalyzer) and the whitespace (WhitespaceAnalyzer) index, but not using the text (StandardAnalyzer) index.

Probably:

  1. check whether this is still present with the newest eXist
  2. if so, check and adjust lucene configuration
thvitt commented 5 years ago

https://github.com/eXist-db/exist/issues/2781

maybe (temporarily?) switch the defaults / the index for the meta texts to sth using the StandardAnalyzer, we'll lose German stemming, though