CredentialEngine / ai-course-crawler

Apache License 2.0
1 stars 0 forks source link

College of Lake County catalogue #48

Open rvilsack opened 2 months ago

rvilsack commented 2 months ago

The College of Lake County CSV file has the following issues.

AI Course Crawler Extract link: https://master.ai-course-crawler.development.c66.me/datasets/courses/32

Extract file in Google Sheets: https://docs.google.com/spreadsheets/d/1YzbK7n46XhLSJKTGpDmda2yy8GUvkidyo43HQsUZTkI/edit?usp=sharing

EXPECTED The course URL look good, with multiple pages of courses that I expected the course crawler could pick up:

cc2

ACTUAL The extract was successful, the template was correct, and it included minimum data. The issue is that the same courses were replicated 100X ; it looks like it picked up the 1 page of data and just repeated it:

cc1

rsaksida commented 2 months ago

CLC is a similar problem to #51 but I think they're not actually using CourseDog. It's a different system, maybe something they built in-house. It's currently not supported by our system allthough we can build support for it if necessary.