Closed GoswamiSH closed 2 years ago
Good idea. I imagine this would just mean creating a parallelizeDynamoDb()
method in SparkFactory
. To that end we might be able to take advantage of audienceproject/spark-dynamodb
audienceproject/spark-dynamodb.
@GoswamiSH is there any other feature you'd need to have to consider this complete?
@mmlinford From the discussion with @aosama and @matthewgillett on the approach and extent of enhancements, I don't think we need any other feature as a pre-requisite to complete this issue.
Will look into audienceproject/spark-dynamodb and get back to you. Let's discuss this with @matthewgillett and @aosama as well.
@GoswamiSH Please see the pull request #55 for DynamoDB support (as well as JSON format files). I used the https://github.com/awslabs/emr-dynamodb-connector dependency for reading the data from DynamoDB into Spark.
This issue is to add functionalities for data validation for DynamoDB.