airbnb / chronon

Chronon is a data platform for serving for AI/ML applications.
Apache License 2.0
673 stars 36 forks source link

Add bounded unique count aggregation #781

Open jbrooks-stripe opened 2 weeks ago

jbrooks-stripe commented 2 weeks ago

Summary

Adds a BOUNDED_UNIQUE_COUNT aggregation. This will allow exact unique/distinct counts, but will cap at a given value to keep memory usage constant.

Why / Goal

We have use cases where we'd prefer an exact solution instead of the approx equivalents, but want to have protections in place so that memory doesn't become an issue.

Test Plan

Checklist

Reviewers