sfu-dhil / wilde

eXist/XQuery app for detecting copying in a collection of XHTML documents.
GNU General Public License v3.0
2 stars 9 forks source link

Searching and accents #31

Closed ubermichael closed 4 years ago

ubermichael commented 4 years ago

Describe the bug The search system treats accented characters differently from unaccented characters.

To Reproduce Steps to reproduce the behavior:

  1. Go to https://dhil.lib.sfu.ca/wilde/search.html
  2. Search for siecle
  3. There are two results

Expected behavior There should be about 30 results, as there are for searching for siècle.

ubermichael commented 4 years ago

A configuration change in the data has fixed it.

          <lucene diacritics="no">
            <text qname="html:p"/>
        </lucene>