CredentialEngine / ai-course-crawler

Apache License 2.0
1 stars 0 forks source link

First Round Testing of Acalog - Inconsistent Descriptions & Credit Min/Max in Extract #55

Open rvilsack opened 1 month ago

rvilsack commented 1 month ago

I'm testing Acalog catalogs.

University of Pittsburgh

Crawler extract: URL: Link to output file: https://docs.google.com/spreadsheets/d/18u8RpLDRkNDEC4SQoc9ZFJHWezXaUwo4vazMDY9PZNY/edit?usp=sharing Number of courses look good Data looks good ISSUE: 1) When there is no description, there is inconsistency in what the extract shows; 2) course desciptions include min + max credit unit values, but are not uniformly captured in the extract

INCONSISTENT DESCRIPTIONS (examples with 3 variations:)

Screenshot 2024-10-01 124609

Screenshot 2024-10-01 124651

Screenshot 2024-10-01 124804

image

CREDIT MIN/MAX (example below with 2 variations)

Screenshot 2024-10-01 134210

image

University of Southern Indiana (Note: this site indicates it's a "Modern Campus" catalog ,which owns Acalog, but our parters in Indiana indicate they're still using Acalog.)

Crawler extract: https://master.ai-course-crawler.development.c66.me/datasets/courses/43 URL: https://bulletin.usi.edu/content.php?catoid=56&catoid=56&navoid=3664 Link to output file: https://docs.google.com/spreadsheets/d/1u2rrd3T32p6yuvHnwTvk_L40qFIg-hURuI93erlODf0/edit?usp=sharing Number of courses look good Data looks good ISSUE: Small numbers of missing credit values when they are present; see example row 12) below:

Screenshot 2024-10-02 082703