tidyverse / rvest

Simple web scraping for R
https://rvest.tidyverse.org
Other
1.49k stars 343 forks source link

Fix read_html_live example #426

Open jrosell opened 4 weeks ago

jrosell commented 4 weeks ago

Closes https://github.com/tidyverse/rvest/issues/430

I checked the read_html_live example and saw that the css selectors changed and a cookie consent banner was added.

This PR is to change the read_html_live() example, so it can reject cookies and extract organizations with the new page version. Reload was needed to force the JSON file download with cookie consent settings already configured.

jrosell commented 4 weeks ago

I see some tests fail because of snapshot changes:

-    Size:   821273
+     Size:   821905
-   Navigating to <http://rstudio.com/>.
+   Navigating to <https://posit.co/>.

Let me know if I have to change something.