silversixpence-crypto / zk-proof-of-assets

MIT License
5 stars 1 forks source link

Get Ethereum anonymity set #37

Closed Stentonian closed 4 months ago

Stentonian commented 8 months ago

Find how to get this dataset, then add code to programmatically generate it on the fly

Stentonian commented 8 months ago

Found some inspo here: https://www.trmlabs.com/post/classifying-one-million-ethereum-addresses

There is this link to a google BigQuery dataset: https://bigquery.cloud.google.com/dataset/bigquery-public-data:ethereum_blockchain

But it doesn't seem to work

Stentonian commented 8 months ago

Here is another source: https://github.com/blockchain-etl/public-datasets?tab=readme-ov-filehttps://github.com/blockchain-etl/public-datasets?tab=readme-ov-file

Links to another Google BigQuery dataset: https://console.cloud.google.com/marketplace/product/ethereum/crypto-ethereum-blockchain

This one seems to work. Under the crypto_ethereum you can find the balances table, where you can use this SQL to get the top 10M addresses:

SELECT * FROM `bigquery-public-data.crypto_ethereum.balances` ORDER BY eth_balance DESC LIMIT 10000000

10M winds up being ~600MB, which you can add to your drive (without paid api access) and then download.

https://drive.google.com/drive/folders/17HESPsxv2r04vlnzA1HvsgIlSZyncAhd

Stentonian commented 8 months ago

Here is another option: https://github.com/blockchain-etl/ethereum-etl

Not sure if you can easily get top-N addresses tho

Stentonian commented 4 months ago

Not going to add the programmatic step