Open algrebe opened 5 years ago
Timers:
lambdas.invocation.success
, lambdas.invocation.failures
Counters:
lambdas.urls.in_progress
urls.new
lambdas.invoked
store.writes
crawlq.get
, crawlq.put
Guages:
lambdas.active
urls.waiting.on_disk
, urls.waiting.in_mem
, urls.in_progress.in_lambda
, urls.completed
Issues found:
1.) Boto3 client internally maintains only 10 open connections. To avoid this limit we first created one boto client object per thread. Potential solutions: Fanout lambdas, use event invocation method, write to DynamoDB to trigger lambdas from there.
our initial implementation was wrong - we need one resource session per thread https://boto3.amazonaws.com/v1/documentation/api/latest/guide/resources.html#multithreading-multiprocessing