Closed Stentonian closed 4 months ago
Found some inspo here: https://www.trmlabs.com/post/classifying-one-million-ethereum-addresses
There is this link to a google BigQuery dataset: https://bigquery.cloud.google.com/dataset/bigquery-public-data:ethereum_blockchain
But it doesn't seem to work
Here is another source: https://github.com/blockchain-etl/public-datasets?tab=readme-ov-filehttps://github.com/blockchain-etl/public-datasets?tab=readme-ov-file
Links to another Google BigQuery dataset: https://console.cloud.google.com/marketplace/product/ethereum/crypto-ethereum-blockchain
This one seems to work. Under the crypto_ethereum you can find the balances table, where you can use this SQL to get the top 10M addresses:
SELECT * FROM `bigquery-public-data.crypto_ethereum.balances` ORDER BY eth_balance DESC LIMIT 10000000
10M winds up being ~600MB, which you can add to your drive (without paid api access) and then download.
https://drive.google.com/drive/folders/17HESPsxv2r04vlnzA1HvsgIlSZyncAhd
Here is another option: https://github.com/blockchain-etl/ethereum-etl
Not sure if you can easily get top-N addresses tho
Not going to add the programmatic step
Find how to get this dataset, then add code to programmatically generate it on the fly