Open ktk opened 13 years ago
Hi @ktk , I found that using the ICU4Refine extension to transliterate() and then urlify() works well for non-ascii UT8 encoded stuff.
Thanks @tr3vr that looks like a good workaround. Could be integrated into OpenRefine.
While using urlify() I noticed that German umlauts get simply trashed. I'm actually not sure how this could be done properly but I propose that the handling works a bit more intelligent than that, especially in regards of other languages which are not in ASCII.