rapidsai / deployment

RAPIDS Deployment Documentation
https://docs.rapids.ai/deployment/stable/
9 stars 28 forks source link

Add a workflow example that uses multi-node Databricks and Dask (ideally also using dask-deltatable) #298

Closed jacobtomlinson closed 7 months ago

jacobtomlinson commented 9 months ago
### Prep tasks
- [x] Choose a workload (xgboost training, cuml training)
- [x] Put a dataset into Delta Lake
- [x] Figure out how to read with dask-deltatable into a dask-cudf dataframe

Workflow structure

Option 1 (goal):

Option 2 (stretch goal):

Related links: