localwiki / localwiki-backend-server

Primary LocalWiki backend server environment
GNU General Public License v2.0
48 stars 16 forks source link

Search should remove accents #13

Open philipn opened 9 years ago

philipn commented 9 years ago

From @philipn on March 3, 2014 4:9

Report from Gene:

Searching needs to ignore accents as well as case. A page like "César E. Chávez Branch Library" (http://oaklandwiki.org/C%C3%A9sar_E._Ch%C3%A1vez_Branch_Library) should match "cesar" `as well as "césar". (It does if "Cesar" is used in the article, but that isn't always done, and ends up much lower in the search results than if it had matched the title.)

Accents are inconsistently used even in 'official' documents and by people who speak the languages in question. In at least one case (French in Canada vs. French in France) this ambiguity is built into the official rules. One includes the accents if the word is capitalized (e.g., ARRÊT) and the other does not (e.g., ARRET).

Copied from original issue: localwiki/localwiki#691