aws / sagemaker-spark-container

The SageMaker Spark Container is a Docker image used to run data processing workloads with the Spark framework on Amazon SageMaker.
Apache License 2.0
36 stars 74 forks source link

resolve netty version by removing the feature store connector from do… #116

Closed xiaoxshe closed 1 year ago

xiaoxshe commented 1 year ago

Issue #, if available: For Spark 3.2, the community version of spark bumped up the netty-all version to 4.1.68, reference is here. And 4.1.68 of netty still has all transitive dependencies as optional, just like 4.1.51. However, for unknown reason, both EMR and sagemaker processing container are pulling 4.1.74 which breaks the original dependency tree (transitive dependencies are no longer optional). EMR is using a enterprise optimized version spark.

For Spark 3.3, spark bumped up the dependency version of netty-all to 4.1.74 which is the version starts causing the problem. Reference is here.

Description of changes: Remove feature store connector dependencies, feature store team will publish the fix to re-enable the library

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.