sandboxnu / major-scraper

Scraping Northeastern's Academic Catalog for use in GraduateNU.
GNU General Public License v3.0
3 stars 0 forks source link

Determine a Good Scraper Data Format #3

Closed AlpacaFur closed 1 year ago

AlpacaFur commented 1 year ago

Summary

Before we start implementing anything we want to make sure that the data format we settle on is good. That way, we can start implementing the Scraper output code and Tooling ingest code in parallel without blocking each other.

Ideally this structure will be both easy for us (as humans) to navigate through while also being easy for our Tooling (and Graduate's backend) to read through.

Tasks

clue4 commented 1 year ago

https://www.notion.so/sandboxnu/The-Ultimate-Scraper-Docs-c9d9bba6e0cd4c46aa7742eaea2c1e67

(still a wip, but our thoughts so far)

clue4 commented 1 year ago

edit: decisions made in doc

clue4 commented 1 year ago

closing as resolved!