dssg / triage

General Purpose Risk Modeling and Prediction Toolkit for Policy and Social Good Problems
Other
182 stars 61 forks source link

Error earlier on duplicates in label/cohort #872

Closed shaycrk closed 2 years ago

shaycrk commented 2 years ago

Currently, when duplicates are encountered in the cohort or label table, this causes a downstream error in trying to save predictions/evaluations (because a duplicate primary key is encountered). We should check for this earlier (as soon as these tables are created) and error immediately rather than proceeding to build models.