awslabs / athena-glue-service-logs

Glue scripts for converting AWS Service Logs for use in Athena
Apache License 2.0
142 stars 45 forks source link

master the converter should only read all data once #31

Open NiklasMolin opened 1 year ago

NiklasMolin commented 1 year ago

Issue #, if available: N/A

Description of changes: The current implementation reads all data twice as far as I can see. The dynamicframe dropNull causes recomputeSchema to be triggered in the toDF call. Guess there is a thousand ways of achieving it. Just added something that solves the matter, in a quick way cause I don't know if this project is still alive.

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.