Quantco / datajudge

Assessing whether data from database complies with reference information.
https://datajudge.readthedocs.io
BSD 3-Clause "New" or "Revised" License
42 stars 3 forks source link

datajudge

CI Documentation Status Conda-forge PypiVersion codecov.io

Express and test specifications against data from database.

Documentation

Usage

datajudge can either be installed via pypi with pip install datajudge or via conda-forge with conda install datajudge -c conda-forge.

Please refer to the Getting Started section of our documentation for details.

Expressing an expectations between different tables from a database may look as such:

from datajudge import BetweenRequirement

companies_between_req = BetweenRequirement.from_tables(
    db_name1="example",
    table_name1="companies",
    db_name2="example",
    table_name2="companies_archive",
)

companies_between_req.add_row_superset_constraint(
    columns1=["name"], columns2=["name"], constant_max_missing_fraction=0
)