ESA-APEx / apex_algorithms

Hosted APEx algorithms
Apache License 2.0
0 stars 0 forks source link

APEx benchmarks: add metrics #7

Closed JeroenVerstraelen closed 2 weeks ago

JeroenVerstraelen commented 3 months ago

Add metrics to the initial benchmark tests.

Goal: partitioned parquet file (partitioned on: time based, benchmarking scenarios)

soxofaan commented 1 month ago

added some docs at https://github.com/ESA-APEx/apex_algorithms/blob/main/docs/benchmarking.md

soxofaan commented 1 month ago

most important aspect to still cover under this ticket:

soxofaan commented 1 month ago

First results from pushing some metrics as parquet to S3:

image

soxofaan commented 1 month ago

and now with unrolling the usage stats: image

soxofaan commented 1 month ago

merged PR #26 which covers the part about partitioned parquet files on S3

soxofaan commented 1 month ago

Ok I think it's time to close this ticket. There were quite some aspects to it and I'm still unsure about some parts of the current approach, so this feels mostly like a proof of concept solution. Also, not everything of this ticket's original requirements are met, but those were "TBD" anyway.

Some details and discussion about this PoC: