Closed amoldavsky closed 5 months ago
>>> from urlextract import URLExtract >>> extractor = URLExtract() >>> extractor.find_urls("You can also visit my website…IMINIT.MYAMBIT.COM") ['website…IMINIT.MYAMBIT.COM'] >>> extractor.find_urls("some%sIMINIT.MYAMBIT.COM" % chr(8231)) ['some‧IMINIT.MYAMBIT.COM']
These are not valid URL characters (going to the left)
Only ASCII is allowed on left from TLD. This case should be fixed in next release.
These are not valid URL characters (going to the left)