opensource-observer / oso

Measuring the impact of open source software
https://opensource.observer
Apache License 2.0
73 stars 16 forks source link

Prototyping: Duckdb based metrics calculation #2469

Closed ravenac95 closed 2 days ago

ravenac95 commented 2 days ago

This is very much still prototypical however, I'm creating a large set of issues to turn this into a functioning production system.

Basically through this fairly large PR I was testing how we could actually integrate all of this into the SQLMesh pipeline that is connected to our iceberg data lake.

Despite this not being fully functional, I will merge this and continue to work on a broader set of issues that will detail the path to getting this all fully deployed on production.

Some general details but through this I was able to take metrics calculation from at least an hours long process for each metric to one that can be run in parallel and calculated within mins for the entire backlog. With further horizontal scaling we'd be able to operate even faster.

Some things learned:

vercel[bot] commented 2 days ago

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name Status Preview Comments Updated (UTC)
kariba-network ✅ Ready (Inspect) Visit Preview 💬 Add feedback Nov 19, 2024 8:03am
oso-www ✅ Ready (Inspect) Visit Preview 💬 Add feedback Nov 19, 2024 8:03am
oso-prs[bot] commented 2 days ago

Test deployment unnecessary, no dbt files have been changed.