When Hive tries to read data from DynamoDB backed Hive table using DynamoDBStorageHandler, read throughput is getting set as null despite ReadCapacityUnits being set in ProvisionedThroughput configured in table properties. This is leading to incorrect mappers calculation during split generation. I found this issue in 4.16.0 version and this issue doesn't exist in 4.9.0 version of dynamodb connector.
When Hive tries to read data from DynamoDB backed Hive table using DynamoDBStorageHandler, read throughput is getting set as null despite ReadCapacityUnits being set in ProvisionedThroughput configured in table properties. This is leading to incorrect mappers calculation during split generation. I found this issue in 4.16.0 version and this issue doesn't exist in 4.9.0 version of dynamodb connector.
The following are the relevant Hive logs:
Split generation log:
I suspect this commit added in 4.11.0 version. But, don't have much context on the change done. So, need some help in fixing this issue.