pyvideo / old-pyvideo-data

DEPRECATED: Video data for Python related conferences
Other
107 stars 38 forks source link

data quality #127

Closed willkg closed 5 years ago

willkg commented 8 years ago

We're still bootstrapping this repository and focusing on things like can we validate data? what's the workflow for fixing small issues? what's the workflow for adding new data and fixing large issues? how do we do review? what's the licensing? how do we onboard new people? what's our "service level agreement" for this data in regards to what we will and won't change and how we change it? ...

That's great. I think that constitutes "phase 1".

Phase 2 is the sorts of things we want to do long term. Long term, we want the data to improve. In order to know what data needs fixing and how good it is now, we need to figure out what factors into data quality for our project and then probably build some kind of metrics/reporting system so that we can track that over time and also surface issues that need fixing.

This issue covers that at a really high level with the expectation that this issue will spawn a bunch of smaller work-product type issues.

willkg commented 5 years ago

Closing this out since this repository is no longer active.