carpentries-incubator / lc-webscraping

Introduction to web scraping
https://carpentries-incubator.github.io/lc-webscraping/
Other
37 stars 28 forks source link

Rework of lesson available for mining into the original #30

Open Denubis opened 5 years ago

Denubis commented 5 years ago

Hi folks,

Last year, I reworked this lesson (https://github.com/resbazSQL/lc-webscraping) as a way of integrating it with the SWC capstone "excel to database." (https://github.com/resbazSQL/capstone-novice-spreadsheet-biblio) My pull request back was rightly rejected for being entirely too large. While I've had "todo: break lesson into commits" on my todo list for the last year, I suppose it's worth noting that the reworked (and taught) lesson is available for other folk (including those working on instructor checkouts) to mine text from.

Here is an incomplete listing of changes:

I hope it's useful to folks who want to find text to potentially address issues they find. It's unlikely that I'll have time in the next few months to break my edits into a series of commits for proper staging back into main.

libcce commented 5 years ago

@Denubis we still need another Maintainer on this lesson, would you be interested once you finish with your training in the next 6 months? @JoshuaDull and @timtomch are the current Maintainers. I know that @JoshuaDull won't be able to look at reviewing and updating the lesson for another 3-4 months at least. Otherwise, I'll let them respond to your changes.

Denubis commented 5 years ago

Yeah, I'd be delighted.

Denubis commented 4 years ago

Random aside, at Resbaz this year, one of the other instructors said that he was using my edit for multiple webscraping sessions and it was running well. Is anyone interested in starting discussions for a rework of this lesson, perhaps incorporating content I wrote last year, or maybe splitting the pythonic/xpath parts?