AbsaOSS / spline-spark-agent

Spline agent for Apache Spark
https://absaoss.github.io/spline/
Apache License 2.0
185 stars 95 forks source link

sqs dispatcher #718

Open ronybony1990 opened 1 year ago

ronybony1990 commented 1 year ago

send executionPlan and event to sqs queue

cerveada commented 1 year ago

The tests are failing because license header is missing in new files.

wajda commented 1 year ago

@ronybony1990, thank you for the PR, it looks like a really good addition to the Spline agent. However there is one thing I'm unsure of - the dependency on the AWS SDK. It almost doubles the size of the agent bundle and pulls several other transitive dependancies widening the area of potential collision with other agent or spark extensions' dependencies. Since at the end of the day the SQS request is made through the AWS API call anyway, I'm thinking that maybe calling it directly without SDK library would be a better approach. Error handling and retrial logic doesn't seem too complicated for the use case. Plus there is FallbackLineageDispatcher and CompositeLineageDispatcher that can be used to improve client fault tolerance. Another option would be making the SDK dependency optional, so that it is not automatically included into the bundle. The user would have to provide all necessary AWS dependencies himself to be able to use SQS dispatcher. Yet another option would be to create an additional spline-agent-extra-aws module or alike that would contain AWS specific plugins and extensions to the agent, to be used complimentary to the agent-bundle.

@cerveada what do you think?

cerveada commented 1 year ago

I mostly agree.

sonarcloud[bot] commented 1 year ago

Kudos, SonarCloud Quality Gate passed!    Quality Gate passed

Bug A 0 Bugs
Vulnerability A 0 Vulnerabilities
Security Hotspot A 0 Security Hotspots
Code Smell A 0 Code Smells

No Coverage information No Coverage information
No Duplication information No Duplication information