Open WillAyd opened 4 days ago
You could try to simulate 2D indexing operations with a bitmask, but something like transposition
This gets handled by setting can_fast_transpose to False.
IIRC @phofl has expressed skepticism about taking on nanoarrow.
Feature Type
[X] Adding new functionality to pandas
[X] Changing existing functionality in pandas
[X] Removing existing functionality in pandas
Problem Description
The existing pd.arrays.BooleanArray serves a good purpose to allow True/False with missing values, but the current implementation is horribly inefficient. Coming from the historical NumPy perspective, the implementation uses twice as much memory. Compared to PyArrow the memory usage is 8x as much and computational algorithms can be up to 64x slower
Feature Description
The pd.arrays.BooleanArray could use nanoarrow behind the scenes for its implementation, rather than the existing NumPy approach.
I think the main technical challenges for this would be:
Alternative Solutions
status quo
Additional Context
No response