DependencyTrack / hyades

Incubating project for decoupling responsibilities from Dependency-Track's monolithic API server into separate, scalable services.
https://dependencytrack.github.io/hyades/latest
Apache License 2.0
59 stars 18 forks source link

Global Kafka and Kafka streams configuration review #70

Open syalioune opened 1 year ago

syalioune commented 1 year ago

The purpose of this issue is to list checkpoints related to Kafka usage that should be validated before releasing the POC (both on the external application and DT side).

Requirements

Topics

Producers

Consumers

Kafka Streams

nscuro commented 1 year ago

Thanks again for yet another detailed and super helpful writeup (and shame on me for the late response ⚰️)!

The processing semantics are definitely something we should consider more sooner than later (although I assume "at least once" will be fine in most cases).

Reading through this list, there are many topics that will depend on the user's expected workload. Meaning we will need to experiment how certain setups behave, and provide proper documentation and example configs. Referring to things like partition count, replication factor etc. here.

syalioune commented 1 year ago

The processing semantics are definitely something we should consider more sooner than later (although I assume "at least once" will be fine in most cases).

As long as the processing logic is idempotent, at least once is sufficient. It should be ok for the analyzers.

Reading through this list, there are many topics that will depend on the user's expected workload. Meaning we will need to experiment how certain setups behave, and provide proper documentation and example configs. Referring to things like partition count, replication factor etc. here.

Yes, that was my goal. List all of the concerns, identify those that should be enforce by DT by default, those related to user's workload and provide sound & safe config/recommendations.