stevesong / cloud_cdn_cache

CDN and Cloud Infrastructure location data, geo-located from corporate location lists.
https://opentelecomdata.org/cdns/
Other
24 stars 2 forks source link
cache cdn cloud internet

CDNs and Cloud Infrastructure

With the aim of creating a more comprehensive picture of global cloud and CDN infrastructure, I have collected data scraped from various service provider websites into CSV and GeoJSON format. So far, I have collected information from:

For the above, I took the cache locations from the websites and geolocated them. Thus the map is only accurate to city level. The dates in the filenames indicate the last time the relevant website was checked.

For the Facebook Network Appliances, I have drawn on Anurag Bhatia's very creative work in work in locating them by the airport code in their names.

For the Netflix Open Connect Appliances, I used SecurityTrails to look for subdomains under oca.nflxvideo.net. A typical OpenConnect appliance subdomain looks like http://ipv4-c008-muc001-ix.1.oca.nflxvideo.net/. Similar to the Facebook Network Appliances, there is a three-letter airport code embedded in the url. I used the airport codes to geolocate the OCAs. There is no guarantees that all of those domains are actual live appliances but it is a place to start. I should also say that the maximum number of subdomains that SecurityTrails would list was 10,000. Thus this list is not exhaustive as there were clearly more. Other handy information in the url includes whether the appliance is located at an IXP or and ISP and whether it is IPv4 or IPv6.

This is very much an exploration so comments and suggestions are extremely welcome. Feel free to reach out to me here or on social media @stevesong@mastodon.cloud or @stevesong

All data licensed under the Open Data Commons Attribution License (ODC-By) v1.0

I am grateful to Mozilla for affording me the time to produce this map. Any errors are of course my own responsibility.