archienorman11 / thesis-bitcoin-clustering

The Bitcoin currency is a publicly available, transparent, large scale network in which every single transaction can be analysed. Multiple tools are used to extract binary information, pre-process data and train machine learning models from the decentralised blockchain. As Bitcoin popularity increases both with consumers and businesses alike, this paper looks at the threat to privacy faced by users through commercial adoption by deriving user attributes, transaction properties and inherent idioms of the network. We define the Bitcoin network protocol, describe heuristics for clustering, mine the web for publicly available user information and finally train supervised learning models. We show that two machine learning algorithms perform successfully in clustering the Bitcoin transactions based on only graphical metrics measured from the transaction network. The Logistic Regression algorithm achieves an F1 score of 0.731 and the Support Vector Machines achieves an F1 score of 0.727. This work demonstrates the value of machine learning and network analysis for business intelligence; on the other hand it also reveals the potential threats to user privacy.
36 stars 12 forks source link

Error when parsing csv transaction data #1

Open ZacharyChenpk opened 5 years ago

ZacharyChenpk commented 5 years ago

When I had done the data-preprocessing and began to run Calculate users, add graph metrics, add categorical tags.ipynb, an error occurred when I came to execute sf = gl.SFrame(transaction_data) as below:

IOError: Cannot open G:/github/thesis-bitcoin-clustering/code/graph-code/data/blocks_417500_424572/dir_archive.ini for read. Cannot open G:/github/thesis-bitcoin-clustering/code/graph-code/data/blocks_417500_424572/dir_archive.ini for reading It seemed that "dir_archive.ini" was required but I couldn't find it in the folder. Why and what should I do?

arashsa2022 commented 2 years ago

I have the same problem, do you find any solution?

ZacharyChenpk commented 2 years ago

I have the same problem, do you find any solution?

Sorry, I don't remember how I dealt with this problem as three years have passed. (Maybe I hadn't found any solution at that time lol)