Dataset Interfaces - Githubissues

TigerAppsOrg / pdata

A centralized data API for Princeton's campus-related data

GNU Lesser General Public License v3.0

1 stars 0 forks source link

The dataset interface defines how the main entrypoint into a dataset should look. Right now, this is in the form of a defined list of tasks, which are functions that are executed on a schedule by Celery.

Generally, the main task will just be a simple function that updates the existing data. However, the notion of general tasks allows for flexibility in the event that different types of data require different periodic maintenance. We allow the dataset itself to declare the schedule at which the tasks should be executed so that varying needs can be met; for example, courses may only need to be updated daily, but the enrollments per-course may need to be updated more frequently.

TigerAppsOrg / pdata

Dataset Interfaces #6