Multiple tasks make logging and metrics tricky

Hi @GregTurner, first of all thanks for trying this project!

The way Taurus prints the response times is based on average and it's not very granular, It aggregates the requests for a given second and then prints the average of it, which is the one being captured as a metric in CloudWatch. However, at the end of the test execution, taurus will print the Summary and show the response times based on percentiles (p50, p90, etc) which is a lot better than looking at the averages, you should see something like this in the CloudWatch Logs of each container log stream:

+---------------+---------------+
| Percentile, % | Resp. Times |
+---------------+---------------+
| 0.0 | 0.1 |
| 50.0 | 0.405 |
| 90.0 | 0.696 |
| 95.0 | 0.797 |
| 99.0 | 1.001 |
| 99.9 | 1.805 |
| 100.0 | 7.692 |

I have posted a question in the Taurus Forums to see if we can get more granular response times while the tests are being executed. This is definitely an area I want to improve on this project.

However, we need to keep in mind that the most important thing when doing Performance Load Testing is to evaluate the behavior of your System Under Test. It's important to monitor and have metrics around the load tests, but don't lose focus on what actually matters, which is to monitor your service itself. You should be learning how it responds, where are the bottlenecks, how does it scale, etc.

aws-samples / distributed-load-testing-using-aws-fargate

Multiple tasks make logging and metrics tricky #6