Open Bisaloo opened 3 days ago
Would a set of HTMLs be okay for you too @avinashladdha? It definitely seems like what @Bisaloo is suggesting is way straightforward if it includes all the information we want 😊
One thing to note is that this endpoint doesn't include vignettes so we would still need an alternative option to collect vignettes.
One alternative along the same lines would be to download the dump of package source from r-universe and get all the relevant files locally.
Quite similar to the current process but with local operations rather than via GitHub API.
HTML is okay to be ingested when calculating embedings instead of .md, a couple of points to keep in mind:
,
Rather than using the GitHub API.
https://epiverse-connect.r-universe.dev/api/snapshot/zip?types=docs
Options presented on https://epiverse-connect.r-universe.dev/apis