Data drop automation - Githubissues

Watts-Lab / commonsense-platform

The common sense platform, rate your common sense.

1 stars 0 forks source link

We want to drop data on some regularity onto the commonsense-data repo so that it can serve as a continuous release of data and tie into our registration paradigm.

Requirements:

extra clean, consistent, and logical naming and scrubbed tables
no PII — this data will be public from day 1.
automated verified commits from
human-readable files that can easily be diffed
files less than 100MB each
some protocol for deciding when to split into new files, e.g. every 1000 submissions or every day, whichever is more frequent.

I'm interested to discuss details here as we start setting it up.

Watts-Lab / commonsense-platform

Data drop automation #206