hcts-hra / ziziphus

10 stars 2 forks source link

[vocabularies]: allow one or two character search for non-latin alphabets #394

Open MatthiasArnold opened 8 years ago

MatthiasArnold commented 8 years ago

the 3-letters minimum rule for allowing a search within controlled vocabularies does not work for non-western scripts, like chinese.

there can be a workaround when using latin transliterations. for example one can search for the full name of Ai Weiwei 艾未未, instead of only looking for "Ai". this is not possible when one uses chinese characters for the search.

searching for "thailand" in chinese is the search for "泰国" which has only two characters. no workaround.

can the search detect if something is written in non-latin characters, or characters are in certain higher ranges of unicode codepoints? we need a way to be able to search for 艾 or 泰国.