mda590 / cloudping.co

AWS Inter-Region Latency Monitoring
https://www.cloudping.co/
658 stars 55 forks source link

CloudPing

Records inter-region latency over a TCP connection between all AWS regions.

About this Project

Over time, as I've worked on global AWS deployments, I have often been faced with the question of which inter-region transactions will be faced with the most latency. I have been able to find a lot of static examples of previous testing completed, or anecdotal thoughts based on a region's location. I haven't been able to find any kind of dynamic, consistently updated, latency monitoring. The goal here is to provide a single source of truth for inter-region AWS region latency.

Architecture

architecture

Parts of Application

Front End

The front end of CloudPing is running in a Fargate container as a Python Flask web server. The web server pulls its data from DynamoDB and uses that data to populate the data in the table.

Region-to-Region Pings

Each active AWS region has a Lambda function that runs every 6 hours. This function does a ping of the public DynamoDB endpoint (dynamodb.<region>.amazonaws.com) and stores the RTT for the ping in to a DynamoDB table.

Averages and Percentile Calculations

Every 6 hours, after the completion of the region-to-region pings, the data is taken from the raw results DynamoDB table. The data is then used to calculate daily, weekly, monthly, and annual averages and percentiles between all of the active regions. This data is then stored in a summary DynamoDB table which is used to provide data to the front-end.

DynamoDB Tables

Deployment Instructions

The Lambda functions are deployed with AWS Chalice. The front-end web site is deployed as a Docker image, stored in ECR, and served by a Fargate service which exists behind an ALB.

TODO

Additional Notes

This project is in no way associated with Amazon or AWS. If you wish to report any issues with the project, please use the "Issues" feature within GitHub.