-
## Description
Users express the need for data schema evaluation to enable "fail-fast" capabilities during data loading and consistency checks before execution. They highlight the potential benefits …
-
**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is.
In 0.16.0, [`PANDERA_VALIDATION_ENABLED`](https://pandera.readthedocs.io/en…
-
**Describe the bug**
If the categories do not coincide between the schema and the data frame, the resulting error message is confusing for the end user. Unfortunately, the cause of the error is almos…
-
What would be the best way to canonically describe a dataset, which could be read by both humans and machines?
For example, frequently in our code we have docstrings which look something like:
`…
-
## Description of issue
When using the `pandera.pyspark` module, validation of a `DataFrameSchema` that uses `Check.str_length()` in a column level check generates `NotImplementedError`.
- [x] I…
-
**Is your feature request related to a problem? Please describe.**
We have the lifecycle API. We can include more adapters, more easily now!
Here's a list of ideas:
- [ ] Profiler. One that doe…
-
## Description
``yaml`` or python are very explicit, but hard to show to managers / stakeholders / business teams. Being able to convert schema to prettier and more organized HTML documents would d…
-
**Is your feature request related to a problem? Please describe.**
Currently, trying out pandera requires installing it in some environment locally. It would be nice to be able to quickly try it ou…
-
#### Code Sample, a copy-pastable example
```python
import pandera as pa
import pandera.typing as pdt
class MySchema(pa.DataFrameModel):
int32: pdt.Series[list[pdt.Int32]] = pa.Field()
`…
-
Just some ideas, could we maybe use a python test framework for [data validation](https://en.wikipedia.org/wiki/Data_validation)? That would be nice, e.g., to generate reports. If we don't want to sto…