Open shaye059 opened 3 years ago
Create one complete pipeline to run the scraping and preprocessing - getting the news articles, calculating the before and after days, getting the stock prices - and set it up as a scheduled cron task on ECS.
The entire pipeline will have to be dockerized.
Setting up scheduled tasks on ECS: https://docs.aws.amazon.com/AmazonECS/latest/developerguide/scheduled_tasks.html
Preferably this would be fault tolerant and modular so that it can make use of EC2 spot instances https://aws.amazon.com/ec2/spot/?cards.sort-by=item.additionalFields.startDateTime&cards.sort-order=asc
Create one complete pipeline to run the scraping and preprocessing - getting the news articles, calculating the before and after days, getting the stock prices - and set it up as a scheduled cron task on ECS.
The entire pipeline will have to be dockerized.
Setting up scheduled tasks on ECS: https://docs.aws.amazon.com/AmazonECS/latest/developerguide/scheduled_tasks.html