crate / cratedb-toolkit

CrateDB Toolkit, an SDK for CrateDB and CrateDB Cloud.
https://cratedb-toolkit.readthedocs.io/
GNU Affero General Public License v3.0
7 stars 3 forks source link

Share and use datasets via Python code #89

Open amotl opened 9 months ago

amotl commented 9 months ago

About

Easily consume datasets from tutorials and/or production applications like others are doing it, using Python code.

References

Standards

amotl commented 6 months ago

About

Those patches add a corresponding miniature subsystem, and bring it into application. With them, cratedb-toolkit will provide convenient access to cratedb-datasets.

Synopsis

from cratedb_toolkit.datasets import load_dataset

dataset = load_dataset("tutorial/weather-basic")
dataset.dbtable(dburi="crate://crate@localhost/", table="weather_data").load()
amotl commented 5 months ago

About

Provide access to datasets at Kaggle, to be easily consumed by tutorials and/or production applications.

Synopsis

from cratedb_toolkit.datasets import load_dataset

dataset = load_dataset("kaggle://guillemservera/global-daily-climate-data/daily_weather.parquet")

# Only download once, nothing else.
dataset.acquire()

# Create table schema in database.
dataset.dbtable(dburi="crate://crate@localhost/", table="kaggle_daily_weather").create()