datonic / datadex

📦 Serverless and local-first Open Data Platform
http://datadex.datonic.io
MIT License
220 stars 14 forks source link

Make it easy to play with the data #64

Open davidgasquez opened 1 month ago

davidgasquez commented 1 month ago

Datadex is producing several datasets. Right now, they're all in HuggingFace as static files.

Should be easy for anyone to click a button from the Portal and have an environment with all the datasets ready to be used. Similar to https://github.com/datonic/datadex/issues/32#issuecomment-1801693041.

The ideal solution is:

  1. Fully static (e.g: DuckDB WASM + Pyodide + Parquet files)
  2. Can be URL encoded so folks can share what they work on

Reallistically, we can start with a Colab Notebook that grabs all the datasets from HuggingFace.