DistrictDataLabs / baleen

An automated ingestion service for blogs to construct a corpus for NLP research.
MIT License
86 stars 38 forks source link

Configurable Scheduling #79

Open will2041 opened 7 years ago

will2041 commented 7 years ago

Acceptance criteria:

Current run frequency is hard coded to every hour: https://github.com/DistrictDataLabs/baleen/blob/master/baleen/console/commands/run.py#L51

We'll need to add a configuration option for this (file, command line, whatever) and respect that. Heck, even a default of an hour is OK. Knowledge of https://github.com/bbengfort/confire will help.