Closed ninoseki closed 4 years ago
Hello @ninoseki, I'll take a look at this and see if I can adjust the regular expression to get this to work.
I think I have a solution. This works:
echo "『http://example.com』インコ\u1f99c" | python iocextract.py
http://example.com
A URL which is surrounded by Japanese characters is not parsed correctly.
I'm not sure how to fix it. But I think checking TLD might work well.