cagov / caldata-mdsa-caltrans-pems

CalData's MDSA project with Caltrans on Performance Measurement System (PeMS) data
https://cagov.github.io/caldata-mdsa-caltrans-pems/
MIT License
7 stars 0 forks source link

Training: Writing Python scripts in Snowflake directly for data analysis #466

Open jkarpen opened 2 weeks ago

jkarpen commented 2 weeks ago

This will be a training on doing data analysis within Snowflake using Python scripts, to avoid having to download the data and doing an analysis locally.

jkarpen commented 2 days ago

Notes from Mintu on the goals for what this session can look like:

Benefits:

The training can be helpful if it includes following but not limited -

jkarpen commented 2 days ago

Note from @ian-r-rose : this training should include discussion that even doing this analysis direct in Python, they will still want to put guardrails around the size of the data being analyzed to avoid incurring high costs since their data is so large. We should include guidance on when it makes sense to do this type of analysis in Snowflake directly vs. another option.