Anonyfox / meteor-scrape

Scrape any Website or RSS/Atom-Feed with ease.
GNU Lesser General Public License v3.0
71 stars 19 forks source link

"url" field broken in edgecases #17

Open Anonyfox opened 9 years ago

Anonyfox commented 9 years ago

when scraping the website http://www.bostonherald.com, the "http:" part gets stripped out. The site has

<link rel="canonical" href="//www.bostonherald.com/" /> in it's HTML, so the "//" symbol must be resolved to "http" by default.