matanolabs / matano

Open source security data lake for threat hunting, detection & response, and cybersecurity analytics at petabyte scale on AWS
https://matano.dev
Apache License 2.0
1.46k stars 100 forks source link

Managed log source for S3 Inventory Reports #66

Closed timoguin closed 1 year ago

timoguin commented 1 year ago

Add support for managing S3 Inventory Reports.

Considerations

Inventory reports can be delivered in one of three formats: CSV, ORC, or Parquet. Since this isn't the only AWS service that can deliver data in Parquet (or ORC) format, we should support ingesting them, especially considering columnar format support will continue to expand to other services.

Inventory reports can be delivered hourly or weekly.

The object metadata to include in the reports is configurable, so we'll need to be sure to handle any missing keys/values.

References

Samrose-Ahmed commented 1 year ago