Closed yingsu00 closed 2 days ago
Any column with whitespace in the name seems to cause this issue. Reproduced using the following steps:
In a Presto engine:
create table iceberg_data.pool.space ("two words" int) with (format='parquet'); set session iceberg_data.parquet_writer_version = 'PARQUET_1_0'; insert into iceberg_data.pool.space values (1), (2), (3);
Then from prestissimo:
select * from iceberg_data.pool.space;
N/A
should read the values out
This issue is caused by several issues:
Fixed in the Tokenizer
Any column with whitespace in the name seems to cause this issue. Reproduced using the following steps:
In a Presto engine:
Then from prestissimo:
``` #0 facebook::velox::common::Tokenizer::computeNext (this=0xfde2c9fb4640) at /workspaces/presto/presto-native-execution/velox/velox/type/Tokenizer.cpp:83 #1 0x00000000087fe224 in facebook::velox::common::Tokenizer::tryToComputeNext (this=0xfde2c9fb4640) at /workspaces/presto/presto-native-execution/velox/velox/type/Tokenizer.cpp:218 #2 0x00000000087fd5dc in facebook::velox::common::Tokenizer::hasNext (this=0xfde2c9fb4640) at /workspaces/presto/presto-native-execution/velox/velox/type/Tokenizer.cpp:39 #3 0x00000000087f70f8 in facebook::velox::common::Subfield::Subfield (this=0xfde2c9fb4720, path="two words", separators=std::shared_ptrBacktrace from GDB
Your Environment
N/A
Expected Behavior
should read the values out
This issue is caused by several issues: