opensearch-project / data-prepper

Data Prepper is a component of the OpenSearch project that accepts, filters, transforms, enriches, and routes data at scale.
https://opensearch.org/docs/latest/clients/data-prepper/index/
Apache License 2.0
259 stars 192 forks source link

Audit Logs #2705

Open cmanning09 opened 1 year ago

cmanning09 commented 1 year ago

Is your feature request related to a problem? Please describe. For pull based sources that perform bulk reading like S3 scan or the OpenSearch source that is in PR. As a user, I would like a mechanism to track which data has been read and processed. This could include if data is dropped, a node in my data prepper cluster becomes unresponsive

Describe the solution you'd like An audit log comes to mind. This log would contain a list of data processing events related to docs or indices or some metadata determine by the source. These logs could be used to determine the exact time frame a set of data was pulled into data prepper.

Describe alternatives you've considered (Optional)

cmanning09 commented 1 year ago

This idea is very vague and contains a lot of ambiguity with alternatives. We need to tighten down the requirements and figure what exactly we want to support.