gauthamsunjay / ariados

A scalable web crawling framework using lambdas
0 stars 0 forks source link

Evaluation - Stage A #16

Open algrebe opened 5 years ago

algrebe commented 5 years ago
gauthamsunjay commented 5 years ago
gauthamsunjay commented 5 years ago

Issues found:

1.) Boto3 client internally maintains only 10 open connections. To avoid this limit we first created one boto client object per thread. Potential solutions: Fanout lambdas, use event invocation method, write to DynamoDB to trigger lambdas from there.

algrebe commented 5 years ago

our initial implementation was wrong - we need one resource session per thread https://boto3.amazonaws.com/v1/documentation/api/latest/guide/resources.html#multithreading-multiprocessing