Open JessicaS11 opened 6 months ago
We shouldn't have to think about formats so this tutorial is hopefully be obsolete in the next 5 years. But we have a long ways to go so we want to share with you what cloud optimized means and why you should care so you can help us get there.
3 learning outcomes
use sliderule to output in parquet demonstrate icesat-2 in geoparquet with lonboard
I'll help out to make sure this notebook from last year can work with lonboard on CryoCloud JupyterHub https://icesat-2-2023.hackweek.io/tutorials/sliderule/parquet-s3.html
As a point of comparison, my colleague Sean went through the process of creating parquet without sliderule and it is much more complicated: https://github.com/developmentseed/icesat-parquet/blob/main/atl08_earthaccess.ipynb. May be worth making that point so participants are motivated to use sliderule.
Lead: Aimee Barciauskas Date: 19/08/2024 Start Time: 1300 Duration: 45 Description:
Details
### Learning Outcomes * outcome 1 * outcome 2 * outcome 3 ### People Developing the Tutorial (content creation, helpers, teachers) ### Summary Description * Why we should care about cloud-optimized formats (now)? * What does it mean to be cloud-optimized? * Cloud formats and cloud computing * Demo of ICESat-2 in Parquet format using lonboard ### Dependencies (things people should know in advance of the tutorial) ### Technical Needs (GPUs? Large file storage? Unique libraries?)