data-lessons / library-webscraping-DEPRECATED

Webscraping lesson for librarians NOW MOVED > https://github.com/LibraryCarpentry/lc-webscraping
https://github.com/LibraryCarpentry/lc-webscraping
Other
4 stars 12 forks source link

Include Beautiful Soup instead of Scrapy (or as an add-on) #6

Closed weaverbel closed 7 years ago

weaverbel commented 7 years ago

Should we include some of the info from here to bring into the lesson? Some people think BS is a better Python tool than Scrapy. https://github.com/qut-dmrc/odiq-web-scraping-workshop @ldko @richyvk

There is also this lesson - both developed by @brendam https://github.com/qut-dmrc/web-scraping-intro-workshop

laufers commented 7 years ago

When we have done web-scraping here at OU, we introduce BS and highlight Pandas ability to scrap tables from a webpage. Here is the outline of the notebook that was used for that lesson which was adapted from http://rjweiss.github.io/VAM-Python/VAM.html

laufers commented 7 years ago

whoops, for the OU link: http://ouinformatics.github.io/swc_beautiful_soup/

ldko commented 7 years ago

On #7 there is further discussion and some agreement about adapting the lesson to use Beautiful Soup.

runderwood commented 7 years ago

See #11 and #12. Might we close this issue and assume this will be accomplished in those?