Closed telzul closed 8 years ago
This is indeed the issue. Recently I had a case with surprising input:
"Jörg Immendor\u0014. Les théâtres de la peinture"
and with the help of stringex it was turned to:
"jorg-immendor\x14-les-theatres-de-la-peinture"
sure. pull request welcome. sorry for slow reply. my inbox is... yeah. that.
any chance of PR #178 being merged and a new gem version release any time soon?
new gem out there now. thanks for reminding me
i have crawled data that for some reason has \u0003 used within their text. It happens that they also use it in their title; as these chars \u0000 - \u001f are not human readable characters, could they be removed from the result?