lipoja / URLExtract

URLExtract is python class for collecting (extracting) URLs from given text based on locating TLD.
MIT License
241 stars 61 forks source link

should not grab email fragments #122

Closed amoldavsky closed 1 year ago

amoldavsky commented 2 years ago
>>> from urlextract import URLExtract
>>> extractor = URLExtract()
>>> extractor.find_urls("@gmail.com")
['@gmail.com']
>>> extractor.find_urls("bad.email @gmail.com")
['bad.email', '@gmail.com']

should not be grabbing @gmail.com at all

lipoja commented 2 years ago

@amoldavsky Hi, could you help me here? What should be the expected result? ['bad.email', 'gmail.com'] or ['bad.email']