๐๐ฎ๐๐ฎ, ๐๐ป๐ฎ๐น๐๐๐ถ๐ฐ๐ & ๐๐. Modern alternative to Snowflake. Cost-effective and simple for massive-scale analytics. https://databend.com
no longer tries to read big TSV in parallel to make it simple and always able to get row id, but still can be deserialized in parallel after cutting into RowBatches.
simplify the handling of skip headers.
minimal reallocating of file data.
Tests
[ ] Unit Test
[x] Logic Test
[ ] Benchmark Test
[ ] No Test - Explain why
Type of change
[ ] Bug Fix (non-breaking change which fixes an issue)
[ ] New Feature (non-breaking change which adds functionality)
[ ] Breaking Change (fix or feature that could cause existing functionality not to work as expected)
I hereby agree to the terms of the CLA available at: https://docs.databend.com/dev/policies/cla/
Summary
read
big TSV in parallel to make it simple and always able to get row id, but still can be deserialized in parallel after cutting into RowBatches.Tests
Type of change
This change isโ