Fix course info scraper 84

stumash / CoursePlanner

MIT License

5 stars 3 forks source link

2 changes to regexes

First Change

The course.info.header.rgx was: [A-Z]{4} [0-9]{3}[[:space:]]+?[A-Z][a-z]+., but is now: [A-Z]{4} [0-9]{3}[[:space:]]+?(\$also listed as [^)]*\$)?[A-Z][a-z]+.

We use this regex to identify the start of a single course's information. Some course's information starts with something like: COMP 101 (also listed as SOEN 101) Intro. to Programming instead of: COMP 101 Intro. to Programming.

Second change

The previous regex was essentially broken and was trying to achieve the result of the new one. The new regex will match the entire string up until the first occurence of either Lecture,Tutorial,Laboratory, NOTE, or $ (end of string).

resolves #84

stumash / CoursePlanner

Fix course info scraper 84 #91

First Change

Second change