rleith / lylina2

Rewrite of my 'river-of-news' RSS reader lylina
2 stars 1 forks source link

"index.php/" causes relative js/css links to recursively load the page #13

Closed nwatson closed 12 years ago

nwatson commented 12 years ago

Going to http://lylina.com/index.php/, or http://lylina.com/index.php/blah/ causes relative links (like the javascript/css) to recursively load the entire page. Some search engine bots got stuck in it:

66.249.72.165 - - [01/Feb/2012:04:38:28 -0500] "GET /index.php/democrats.oversight.house.gov/images/stories/MINORITY/fcic%20report/en/democrats.oversight.house.gov/images/stories/MINORITY/fcic%20report/www.photopile.me/user/thatdrew/www.guardian.co.uk/business/2011/dec/28/2011/12/29/what-is-it-game-207/kotaku.com/judiciary.house.gov/issues/neatobambino/www.youtube.com/www.youtube.com/mashable.com/category/cache/5987249038ed61e628431e52b91941fd.ico HTTP/1.1" 200 1020656 "-" "Googlebot-Image/1.0"

etc.

Temporarily added a robots.txt disallowing all bots.

rleith commented 12 years ago

Implemented by handling the bad URLs better in 3a35655307919e71b86b7234aa3d2acb0d0063cf