Deduplication of events according to a given set of keys is a common way to reduce the volume downstream. A dedupe operator can achieve this function.
### Definition of Done
- [x] Look at the equivalent functionality at [Splunk](https://docs.splunk.com/Documentation/SplunkCloud/latest/SearchReference/dedup), [Cribl](https://docs.cribl.io/search/dedup/) ([blog](https://cribl.io/blog/streaming-data-deduplication-with-cribl/)), and others
- [x] Design the deduplication mechanism (e.g., compound record hash, hash table expiry, etc.)
- [x] Agree on the operator UX
- [x] Implement and test the operator
Deduplication of events according to a given set of keys is a common way to reduce the volume downstream. A
dedupe
operator can achieve this function.