A CLI to scrape some really useful UTD data, parse it, and upload it to the Nebula API database for community use.
Part of Project Nebula.
grade-data
directory contains .csv files of UTD grade data.S
, U
, or F
denoting Spring, Summer, and Fall semesters, respectively.22F.csv
corresponds to the 2022 Fall semester, whereas 18U.csv
corresponds with the 2018 Summer semester.scrapers
directory contains the scrapers for various UTD data sources. This is where the data pipeline begins.parser
directory contains the files and methods that parse the scraped data. This is the 'middle man' of the data pipeline.uploader
directory contains the uploader that sends the parsed data to the Nebula API MongoDB database. This is the final stage of the data pipeline.Documentation for the project will be created soon, but for more information please visit our Discord.
To build the project, simply clone the repository and then either:
make
in the root (top level) directory (for systems with make
installed, i.e. most Linux distros, MacOS)build.bat
on Windows systems (unless you want to deal with getting make
to work on Windows :P)The build process will output an executable file named api-tools
; this executable is the CLI and can be ran in your terminal!
[TBD]
If you have any questions about this project, reach out to the Project Nebula maintainers at core-maintainers@utdnebula.com or open an issue or discussion on this repository.