Open chitralverma opened 1 year ago
You mean extending the cursor to return fetch results in Arrow format instead of the current row oriented way?
I don't know of any plans but it sounds like a good addition to Impyla.
Yes, it will be great if we could have as_pyarrow_table
and as_pyarrow_dataset
options available some where to return the results as a PyArrow Table (eagerly) or PyArrow Dataset (lazyily) which doing zero-copy.
I would second this strongly. +1
I'm not a Python guy, but I'm using this since it is the only (client/driver) I know for Impala that is stable and feature-complete. Adding arrow data format support will allow us to wrap it in different languages/systems with minimal cost.
Adding a basic implementation similar to as_pandas (https://github.com/cloudera/impyla/blob/a3d80ef353f1bd779ab81166785e40dd2100d712/impala/util.py#L46 ) seems quite simple. I see two things that could make this more complicated:
You mean extending the cursor to return fetch results in Arrow format instead of the current row oriented way?
I don't know of any plans but it sounds like a good addition to Impyla.