Out-of-Core hybrid Apache Arrow/NumPy DataFrame for Python, ML, visualization and exploration of big tabular data at a billion rows per second 🚀
8.3k
stars
590
forks
source link
function .isin() rasie Exception: pyarrow.lib import ArrowInvalid: Cannot append scalar of type string to builder for type large_string #2444
Open
myloe00 opened 2 weeks ago
I found out by debugging that while
vaex.expression.Expression.to_arrow
return a dataset more than one chunk, like this:this exception will be raised. So I try to change function
vaex.expression.Expression.__arrow_array__
toAnd the exception resolved.
So, can i do it like this . Or other method to resovle this problem?