jonthegeek / wapir

Web APIs with R
https://wapir.io/
22 stars 3 forks source link

Scraping Examples #19

Open jonthegeek opened 10 months ago

jonthegeek commented 10 months ago

Add an appendix with examples for scraping chapter, or maybe multiple appendices. That way you have control over the content + format, mostly at least. Or I guess consider putting it outside the book as raw html. In this repo or elsewhere? Deployed in github pages? Can the quarto site have static, unrelated html?

jonthegeek commented 10 months ago
  1. A static table that's the only thing on the page. Content = ? Starting to work in some API terminology would be sneaky. Maybe the main sections of an OpenAPI 3.1 doc?
  2. Multiple tables and/or weird formatting that makes html_table() a little painful. SelectorGadget needs to be easier than html_table, or at least clearer. A page with HTTP Request Methods in one table and HTTP Status Codes in a second might suffice. Probably need to format something weird to freak out html_table.
  3. Structured content not in a table, like the {rvest} Star Wars data. A collection of Xpath rules would be fantastic, if I can put something together (or reshare something with an appropriate license).
  4. Structured content in a particular cell of 1 of 2 tables on a page, to show piping / using Xpath directly. Include images so that one of the things we grab can be src's of those images. Images are a great example for this, actually.
  5. Can I ~easily deploy something that requires a session? If so, do that. Might require something like Netlify, but I also MIGHT be able to do simple (non-secure) session stuff purely via HTML/GitHub Pages.
  6. Probably a page with HTTP request methods and status codes, to slyly