Reworking a _parquet_output_format_converter function to speeding it up by vectorizing and using numpy array. Previously it was taking around 50 sec. Can see in below picture.
But by vectorizing it. It improved to 1 second.
The result is also checked its the same.
Original function output:
Updated function output:
Additions
-
Removals
-
Changes
-
Testing
1.
Screenshots
Notes
-
Todos
-
Checklist
[ ] PR has an informative and human-readable title
[ ] Changes are limited to a single goal (no scope creep)
[ ] Code can be automatically merged (no conflicts)
[ ] Code follows project standards (link if applicable)
[ ] Passes all existing automated tests
[ ] Any change in functionality is tested
[ ] New functions are documented (with a description, list of inputs, and expected output)
[ ] Placeholder code is flagged / future todos are captured in comments
[ ] Visually tested in supported browsers and devices (see checklist below :point_down:)
[ ] Project documentation has been updated (including the "Unreleased" section of the CHANGELOG)
[ ] Reviewers requested with the Reviewers tool :arrow_right:
Reworking a
_parquet_output_format_converter
function to speeding it up by vectorizing and using numpy array. Previously it was taking around 50 sec. Can see in below picture.But by vectorizing it. It improved to 1 second.
The result is also checked its the same. Original function output:
Updated function output:
Additions
-
Removals
-
Changes
-
Testing
1.
Screenshots
Notes
-
Todos
-
Checklist
Testing checklist
Target Environment support
Accessibility
Other