Open ouwei-xhs opened 4 months ago
I've written a program to convert my data to parquet format. during my test, I've encountered such an error on FileWrite::Close
parquet_writer.cpp:186 Failed to close writer: IOError: Couldn't serialize thrift: Internal buffer size overflow
the corresponding implementation is:
it seems like the case mentioned in this issue
during my test, I've also found that the error is related to the column count. if the schema contains 8 columns, then it fails:
parquet_writer.cpp:172 The table constains: [8] columns and [70409] rows parquet_writer.cpp:186 Failed to close writer: IOError: Couldn't serialize thrift: Internal buffer size overflow
but it contains less than 8, then it works well:
parquet_writer.cpp:172 The table constains: [7] columns and [70409] rows processor.cpp:41 Try to upload parquet file 0.parquet, file size: 7072759
Is there any params to tune to solve this issue?
C++
Which version of Arrow C++ are you using? Also, did you post the entire error message or did you truncate it?
Describe the usage question you have. Please include as many useful details as possible.
I've written a program to convert my data to parquet format. during my test, I've encountered such an error on FileWrite::Close
the corresponding implementation is:
it seems like the case mentioned in this issue
during my test, I've also found that the error is related to the column count. if the schema contains 8 columns, then it fails:
but it contains less than 8, then it works well:
Is there any params to tune to solve this issue?
Component(s)
C++