ICESAT-2HackWeek / h5cloud

BSD 3-Clause "New" or "Revised" License
7 stars 2 forks source link

Re-plot performance testing based on individual files #25

Open asteiker opened 10 months ago

asteiker commented 10 months ago

The plot shown in #19 demonstrates collated testing runs across i/o libraries for each data format (original, repack, kerchunk-original, kerchunk-repack). It would be more valuable to create a scatter plot of the performance testing for individual file runs based on #20 to better observe the within-group variability. Grouping by tool may be most valuable.

### Tasks
- [x] Update h5cloud/helpers/s3filelinks.json to point to persistent data with new bucket
- [x] Re-run /h5cloud/notebooks/run-tests.ipynb with a  new results directory
- [x] Persist pandas dataframe in benchmark notebook to write to csv
- [x] Replot as scatter plot grouped by tool
- [x] Plot all file and i/o param combinations on a single plot
asteiker commented 9 months ago

Dependent on #28