Open thvitt opened 5 years ago
@wolfgangmm I assume you have experience on this?
reproducible on RC8 via docker, however I see 5 test failures out of 9 tests. @thvitt thanks for making this easy to reproduce by adding tests
<testsuites>
<testsuite package="http://www.faustedition.net/exist/test" timestamp="2019-06-10T10:12:43.813Z" tests="9" failures="5" errors="0" pending="0" time="PT0.188S">
<testcase name="testDefault" class="t:testDefault"/>
<testcase name="testDefaultASCII" class="t:testDefaultASCII"/>
<testcase name="testDefaultTrunc" class="t:testDefaultTrunc"/>
<testcase name="testGerman" class="t:testGerman">
<failure message="assertExists failed." type="failure-error-code-1"/>
<output/>
</testcase>
<testcase name="testGermanASCII" class="t:testGermanASCII"/>
<testcase name="testGermanTrunc" class="t:testGermanTrunc">
<failure message="assertExists failed." type="failure-error-code-1"/>
<output/>
</testcase>
<testcase name="testWhitespace" class="t:testWhitespace">
<failure message="assertExists failed." type="failure-error-code-1"/>
<output/>
</testcase>
<testcase name="testWhitespaceASCII" class="t:testWhitespaceASCII">
<failure message="assertExists failed." type="failure-error-code-1"/>
<output/>
</testcase>
<testcase name="testWhitespaceTrunc" class="t:testWhitespaceTrunc">
<failure message="assertExists failed." type="failure-error-code-1"/>
<output/>
</testcase>
</testsuite>
</testsuites>
What is the problem
When configuring eXist to use Lucene’s GermanAnalyzer or a WhitespaceAnalyzer for the full-text search, search terms containing both umlauts and truncation like
Röntgen*
to findRöntgenbilder
yields no results. With the WhitespaceAnalyser, truncation generally doesn’t seem to lead to results. Using the StandardAnalyzer, everything works as expected.What did you expect
Röntgen*
to findRöntgenbilder
regardless of the analyzer.Describe how to reproduce or add a test
3/9 tests fail for me, see comments below:
I’ve run the tests on an otherwise clean eXist 5.0-RC7. The problem also exists on eXist 4.4.0.
Context information