Open chebbyChefNEQ opened 1 year ago
pa.RecordBatch.from_pylist([{"test": [1, 2, 3]}])
produces a variable size list of int64, so it would need to be cast to a fixed-size list of float32. We do the casting for iterables. There isn't a kernel for casting variable to fixed size list in PyArrow, which is why the last one is erroring. And the current bug in arrow-rs means an exception raised in a RBR is a segfault.
the above script yields output:
In the first two cases, maybe we should print a warning about setting
schema=
when the data source ispa.Table | pa.RecordBatch
, where the specified schema is ignored.In the third case, we crash on
abort
because the list of record batch and schema mismatches. (ideally user would call with RBR, but sometime just passing a list of RecordBatches is convenient.0