Closed Niccolum closed 2 years ago
woops, hang tight @Niccolum, I updated the README too soon 😅 0.10.0, which adds support for pyspark.pandas, should be out by EOD today
hey @Niccolum pandera now supports pyspark.pandas
dataframes! https://pandera.readthedocs.io/en/stable/pyspark.html
Out of curiosity: I haven't tested validating OG pyspark dataframes, I suspect that is not supported... is this a feature that you'd want, or is converting pyspark DFs to pyspark.pandas DFs sufficient?
In the future, it may be needed, but for now it is enough)
Is your feature request related to a problem? Please describe. pandera is a great library, but it doesn't support pyspark DataFrame yet
Describe the solution you'd like Schema model for pyspark DataFrame
pip install pandera[pyspark]