The Web Data Science course outline specifically mentioned PDFs, and somehow I thought I'd avoid them... but I really should cover them. Within the rvest chapter? A separate chapter?
Probably try some scraping and see how much tooling there is to talk about. It might only be a ~section if there isn't much to say (other than "Good luck!")
The Web Data Science course outline specifically mentioned PDFs, and somehow I thought I'd avoid them... but I really should cover them. Within the rvest chapter? A separate chapter?
Probably try some scraping and see how much tooling there is to talk about. It might only be a ~section if there isn't much to say (other than "Good luck!")
https://www.copyright.gov/fair-use/fair-index.html has links to PDFs to try in this spike.