Open vovavili opened 2 years ago
SchemaModel DataFrame check(s):
If duplicate columns are found, they should be documented in the err.failure_cases
@wakelt FYI the DataFrameSchema
(or SchemaModel.Config
option) has a unique
option that checks for duplicate records: https://pandera.readthedocs.io/en/stable/dataframe_schemas.html#validating-the-joint-uniqueness-of-columns
Discussed in https://github.com/pandera-dev/pandera/discussions/799