add options to just run workflow functions for just specific buckets, courses, or sessions
this should be in the form of a command-line parameter to these functions (something like raw_data_buckets, courses, sessions) such that users can run the workflow functions against a subset of the data.
This should also include an option to not clear the s3 subdirectory (so that when a job fails, for example, on a specific session, the user can just re-run against that session without running the whole job again)
add options to just run workflow functions for just specific buckets, courses, or sessions
this should be in the form of a command-line parameter to these functions (something like
raw_data_buckets
,courses
,sessions
) such that users can run the workflow functions against a subset of the data.This should also include an option to not clear the s3 subdirectory (so that when a job fails, for example, on a specific session, the user can just re-run against that session without running the whole job again)