Sometimes this calendar throws a "cannot find content" error (while scraping), but response status is still 200. Right now, it makes 5 attempts and ignores that month if those fail. Could it be the request headers?
I was unsure what the session should be for this scraper – it's currently <YEAR> ENGINEERING. We could choose a session (fall/winter/summer) based on the month, but we may run into some odd cases (this is completely hypothetical):
{
"date": "2016-04-05",
"events": [{
"end_date": "2016-04-05",
"session": "2016 WINTER",
"campus": "UTSG",
"description": "First day for course enrolment for the summer term." // in ArtSci, UTM this would fall under the `2016 SUMMER` session.
}]
}
UTMCalendar
changed toUTMDates
Add
UTSGDates
ArtSciDates
- http://www.artsci.utoronto.ca/current/course/timetable/EngDates
- http://www.undergrad.engineering.utoronto.ca/About/Dates_Deadlines.htm200
. Right now, it makes 5 attempts and ignores that month if those fail. Could it be the request headers?I was unsure what the
session
should be for this scraper – it's currently<YEAR> ENGINEERING
. We could choose a session (fall/winter/summer) based on the month, but we may run into some odd cases (this is completely hypothetical):