c4software / python-sitemap

Mini website crawler to make sitemap from a website.
GNU General Public License v3.0
362 stars 110 forks source link

Make link regex ignore other attributes #16

Closed Garrett-R closed 8 years ago

Garrett-R commented 8 years ago

Currently, if you have a link such as:

<a class='hello' href='/about'>

then this link is missed. This update the regex ensures these are caught.

c4software commented 8 years ago

Nice catch! Thanks you :+1: