cobalt-uoft / uoft-scrapers

Public web scraping scripts for the University of Toronto.
https://pypi.python.org/pypi/uoftscrapers
MIT License
48 stars 14 forks source link

UTSC timetable scraper #7

Open qasim opened 9 years ago

qasim commented 9 years ago

The UTSC timetable needs a scraper.

vatdaell commented 8 years ago

Hey I can work on that

qasim commented 8 years ago

@vatdaell Cool! Here's where the file for it is: uoftscrapers/scrapers/timetable/utsc/__init__.py

And these are the places I found where UTSC posts info. If you're a UTSC student, you may know better.

Keep us updated if you find anything else / want feedback!

vatdaell commented 8 years ago

Hey, I think we might have a problem with the UTSC calendar due to the current formatting of the utsg calendar. Some things do not apply to UTSC courses

arkon commented 8 years ago

@vatdaell Like what?

vatdaell commented 8 years ago

@arkon enrollment numbers and limits are not public as of now. Also I'm not sure what id is used for. Plus are breadths just the breadth that each course belongs into?

qasim commented 8 years ago

@vatdaell You can set enrolment numbers / limits to zero by default in that case.

id is a unique identifier for the course. Basically it's split into the following pieces:

This is unique to every course taught at all 3 campuses. Hopefully the website you're scraping from will give you bits and pieces that you can put together to form the full id. Let me know if you need clarification or help with anything!

Any yeah, breadths are a UTSG thing so UTSC might not apply. You can leave them as empty arrays.

vatdaell commented 8 years ago

Thanks