[x] Bitcoin transaction data is available on the CS224W website, which contains every Bitcoin transaction made prior to April 7, 2013. All transactions are available on a public ledger, and this data set is a large text file containing a line for every transaction. Each line includes the transaction id, sender, recipient, value (in BTC), and a timestamp. Transactions involving multiple senders and multiple receivers are represented by multiple lines with the same transaction id.
[x] This file comes with a matching file containing one line for each group of addresses that appear together in some transaction, hence belonging to the same user or entity. We can use this file to transform our data set into a simplified one indexed by ”user” entity rather than address using the Union Find algorithm, based on the intuition that for any given transaction, only one entity could be the sender of that transaction, even if multiple sending accounts were used.
REF :
[1] Using the Bitcoin Transaction Graph to Predict the Price of Bitcoin, Alex Greaves, Benjamin Au December 8, 2015
REF : [1] Using the Bitcoin Transaction Graph to Predict the Price of Bitcoin, Alex Greaves, Benjamin Au December 8, 2015