seomoz / url-cpp

C++ bindings for url parsing and sanitization
MIT License
19 stars 11 forks source link

Parser not filtering out bad URLs? #24

Open jowellv opened 7 years ago

jowellv commented 7 years ago

The url parser seems to be treating this a valid URL:

http://ww.monitor.co.ugcom.coremedia.mauritius.cae.contentbeans.mauarticleimpl$$[id=1749038

Perhaps it would be better to filter out URLs of this type.

dlecocq commented 7 years ago

It's something we can (and should) cover. This hasn't historically been a huge concern for us because such malformed domains are unlikely to have a campaign associated with them. But rest assured we will tackle it.