Open cosmicBboy opened 1 year ago
Hi ! Any update on this issue ? I'm using pandera to validate dataframes. I would like to deploy services with bentoml. It would be wonderful to be able to validate dataframe schema with pandera. It is a bit linked with an issue I've opened: https://github.com/bentoml/BentoML/issues/3652 Since it would at least allow to have fine validation and exceptions for inputs
Feature request
Just like a
pydantic_model
can be specified in thebentoml.io.JSON
descriptor, providing an option to specify apandera.SchemaModel
in thePandasDataFrame
descriptor would enable users to validate incoming/outgoing dataframes in their service apis with more complex, potentially custom validation checks defined in pandera SchemaModels.Something like:
Motivation
The current
PandasDataFrame
io descriptor allows fordtype
enforcement but it's up to the user to implement other statistical validation checks, like range values, allowable values, potentially more complex checks that can be expressed by pandera (see here and here.By supporting this feature, bentoml api services would be able to automatically validate these statistical properties.
Other
I'm the author of pandera 👋 and love the bentoml project!