datasets / awesome-data

Curated list of quality open datasets
https://datahub.io/collections
747 stars 91 forks source link

Top 1 million websites in the world (Majestic Million) #320

Open rufuspollock opened 4 years ago

rufuspollock commented 4 years ago

https://blog.majestic.com/development/majestic-million-csv-daily/

The Majestic Million is a list of the top 1 million website in the world, ordered by the number of referring subnets. A subnet is a bit complex – but to a layman it is basically anything within an IP range, ignoring the last three digits of the IP number.

http://downloads.majestic.com/majestic_million.csv

License

Majestic Million CSV by Majestic 12 is licensed under a Creative Commons Attribution 3.0 Unported License.

Please – if you have made use of this data for a benevolent purpose, please mention it in the comments. We are dying to see what you use it for. We cannot promise to publish all uses and the CSV is of course provided without warranties or support unless you are on a paid plan for other API options.

risenW commented 3 years ago

I have successfully added this dataset to Datahub. Link here