Open mlkui opened 10 months ago
Hi, thanks for your report. Could you include data so that your example is copy-pasteable?
Hi, thanks for your report. Could you include data so that your example is copy-pasteable?
@phofl Sure, I have modified the previous code to make it copy-pasteable. You can download the small test data from https://github.com/mlkui/pandas_test_data Using a large arrow can make the issue more apparent.
@phofl Hi, dose the problem exist?
Pandas version checks
[X] I have checked that this issue has not already been reported.
[X] I have confirmed this bug exists on the latest version of pandas.
[X] I have confirmed this bug exists on the main branch of pandas.
Reproducible Example
Issue Description
Arrow table concat and pandas.concat should be zero-copy, but when concat two zero-copy dataframe(convert from arrow table), copy happens even pandas COW is turned on.
Also, currently, trying to concat two arrow table and then convert the table to dataframe with zero_copy_only=True is also not allowed as the chunknum>1.
Expected Behavior
When using pandas.concat to concatenate two zero-copy dataframes (converted from Arrow tables), it should not involve any copying.
Installed Versions