Open Wainberg opened 8 months ago
>>> pl.Series(['foo', 'bar']).to_arrow() # ok <pyarrow.lib.LargeStringArray object at 0x7f6bf91ec2e0> [ "foo", "bar" ] >>> pl.Series([b'foo', b'bar'], dtype=pl.Object).to_arrow() # ok <pyarrow.lib.FixedSizeBinaryArray object at 0x7f6bf91ec2e0> [ 805086F96B7F0000, 0058C4EF6B7F0000 ] >>> pl.Series(['foo', 'bar'], dtype=pl.Object).to_arrow() # ?? <pyarrow.lib.FixedSizeBinaryArray object at 0x7f6bf91ec7c0> [ 106086F96B7F0000, 104B86F96B7F0000 ]
No response
pl.Series(['foo', 'bar'], dtype=pl.Object).to_arrow() returns a FixedSizeBinaryArray.
pl.Series(['foo', 'bar'], dtype=pl.Object).to_arrow()
FixedSizeBinaryArray
It should convert to LargeStringArray, not FixedSizeBinaryArray.
LargeStringArray
They are objects. They are opaque and should not be successful in converting to Arrow. Arrow doesn't support objects. We should raise an error here.
Checks
Reproducible example
Log output
No response
Issue description
pl.Series(['foo', 'bar'], dtype=pl.Object).to_arrow()
returns aFixedSizeBinaryArray
.Expected behavior
It should convert to
LargeStringArray
, notFixedSizeBinaryArray
.Installed versions