Closed laineymajor closed 1 year ago
@laineymajor to send out communications to VFS teams about upcoming load testing.
@laineymajor drafting new MVP load testing plan and will include in this ticket
@laineymajor update with MVP plan
grafana (or jmeter) current request rate dashboar - lindsey to add
I've updated the action steps to match the new MVP load testing plan
This ticket will carryover to next sprint as we needed to give VFS teams some lead time to prep for load testing. We will be completing load testing together on March 20.
TEAMS TO TEST
What needs to be done BEFORE load testing on Monday:
Potential boards to pay attention to: Load testing dashboards:
Load testing today at 12:30PM
We had an issue with the check-in team tests. 500 errors were being returned, but we pushed a fix. Thread here
After the pushed the fix, load tests ran at 100% success.
@considerable Please type up a summary of the outcome and the issues we ran into. Thread here
Adjusted liveness probes yesterday. Requests are only going to one pod, so we need to re-run the tests. The script may be mocked and requests are for some reason going to 1 pod.
Next step: scale down and re-run tests @oseasmoran73, @considerable and Kanchana will need to re-run the scripts. We will scale down to 2 pods or decrease the targetAverageValue.
Console commands:
git clone git@github.com:department-of-veterans-affairs/vets-api-loadtest.git
cd vets-api-loadtest
docker run --rm -v `pwd`/loadtest:/loadtest -i locustio/locust:2.14.2 \
-u 40 -r 5 -t 30m --headless --only-summary -H https://staging-api.va.gov \
-f /loadtest/search/search_locust.py
At 2:25 on 3/23, the team ran the loadtest script and scaling is working
After the load test completed, pods scaled down properly as well!
Once the loadtest summary is updated, we can close this ticket @considerable
@considerable did you complete load testing on Monday with the VFS teams? Please provide a detailed summary on the work you did with the VFS teams.
@considerable to provide summary, then close ticket (before COB 3.27).
Note: Amazon blog Load testing your workload running on Amazon EKS with Locust explains load testing for the performance and reliability of a workload by generating artificial loads that mimics real-world traffic.
Reality check with actual load test ran 3/20/2023 against staging-api.va.gov:
Lesson learned:
So, what the load test said?
Has the EKS infrastructure enough capacity to run the code? - YES:
Note: Amazon recommends performing load tests to choose an automatic scaling configuration that works the way you want. See Load testing your auto-scaling configuration.
Important notes:
PROBLEM STATEMENT Load testing needs to be completed to check how systems function while a heavy volume of concurrent virtual users perform transactions over a period of time. We will run a Load Test to:
ACTION STEPS/TASKS
DEFINITION OF DONE