Currently pyarrow.parquet.core.filters_to_expression handles equality/inequality operators and in and not in operators.
I propose adding is_nan, is_null, is_valid operators where value passed would be ignored but they would
return field.is_nan() field.is_null() field.is_valid() expressions.
Describe the enhancement requested
Currently pyarrow.parquet.core.filters_to_expression handles equality/inequality operators and in and not in operators.
I propose adding is_nan, is_null, is_valid operators where value passed would be ignored but they would return field.is_nan() field.is_null() field.is_valid() expressions.
This is a very easy change but it would allow null/nan filtering in DNF form. These functions are already implemented for pyarrow.dataset.Expression https://arrow.apache.org/docs/python/generated/pyarrow.dataset.Expression.html
Component(s)
Python