Investigate using duckdb for rolling window queries

opensource-observer / oso

Measuring the impact of open source software

Apache License 2.0

73 stars 16 forks source link

What is it?

Mostly retroactive as this was a random hunch that seems to be working. Basically the thought was that for any rolling window query we could actually have duckdb load the dependent tables into memory and then run the queries. We could even get fairly smart with this if we are able to target specific partitions as well if things got large enough. The reason I wanted to try this is because it seemed that because of the way SQLMesh was scheduling runs the cache for trino never really got fully warm as it was running things so things ended up being slow.

opensource-observer / oso

Investigate using duckdb for rolling window queries #2379

What is it?