GH-44651: [Python] Allow from_buffers to work with StringView on Python

apache / arrow

Apache Arrow is the universal columnar format and multi-language toolbox for fast data interchange and in-memory analytics

Apache License 2.0

14.65k stars 3.55k forks source link

Rationale for this change

Currently from_buffers is not working with StringView on Python because we validate against num_buffers. This only take into account the mandatory buffers but does not take into account the variadic_spec that can be present for both string_view and binary_view

What changes are included in this PR?

Take into account whether the type contains a variadic_spec for the non-mandatory buffers and only check lower_bound number of buffers.

Are these changes tested?

Yes, I've added a couple of tests.

Are there any user-facing changes?

We are exposing a new method on the Python DataType. has_variadic_buffers which tells us whether the number of buffers expected is only lower-bounded by num_buffers.

GitHub Issue: #44651

apache / arrow