Closed cansavvy closed 2 years ago
This is probably a better scraper: https://github.com/alirezamika/autoscraper
Here's the idea:
Check it out! (NA means there's not a bookdown associated with the course as far as the GitHub API is concerned) https://docs.google.com/spreadsheets/d/1klDpaQcGjYUa5Xro-DxqNTJq7Did-Ujn7IGgm7W6kJ8/edit#gid=65359487
This accomplishes Steps 1 - 6 so far.
The rest of the issues for this will be tracked on https://github.com/jhudsl/gitHelpeR
Describe the your scope of your content idea
To cut down on manual labor, I'm going to try to scrape as much course info from the jhudsl and DataTrail organizations as I can so we can add them to the library googlsheet to start off: https://docs.google.com/spreadsheets/d/13TvG95v71a0QsCcaZC7zB4GbtF67Q6Bb-Dc_GLOScHY/edit#gid=0
Ideas on how to get there
This github scraper is something to look into: https://github.com/sbaack/github-scraper