propublica / Capitol-Words

Scraping, parsing and indexing the daily Congressional Record to support phrase search over time, and by legislator and date
BSD 3-Clause "New" or "Revised" License
122 stars 34 forks source link

specific date urls are throwing 500s #75

Open timball opened 10 years ago

timball commented 10 years ago

From goog webmaster tools a pile of urls of the form http://capitolwords.org/date/< YEAR >/< MO >/< DAY >/ are throwing 500 errors.

specific but not extensive example: http://capitolwords.org/date/2001/03/22/ but http://capitolwords.org/date/2001/03/ works and even lists the 22nd as a valid date . when you click on the 22nd from that month url it falls down go boom .

it doesn't appear to be ALL dates just "some" and by "some" goog wmt claims 157-ish

as a for instance: http://capitolwords.org/date/2004/09/13/ <--- works http://capitolwords.org/date/2004/09/09/ <--- doesn't

--timball

drinks commented 10 years ago

Hum. I need to add sentry I guess.