During my work with the Ransomwhere dataset, I have noticed that the first transaction in the current version of the Ransomwhere dataset is on 2012-03-09 03:31:54, which can be verified using the following Unix pipeline (here and further we assume the latest version of the JSON exported from the www.ransomwhe.re website is stored in the data.json file in the current directory):
However, the first entry in the BTC-USD.csv file, which is used to convert the amount in BTC to USD, is 2014-09-17. As a result, transactions before this date result in 0 in the amount in USD, e.g. as can be seen here (can be verified using sth like jq '.' data.json | grep -C 10 '"amountUSD": 0$'):
In total, there were 640 transactions before September 17, 2014, which can be verified as follows:
This pull request adds the missing Bitcoin prices before September 17, 2014 to the BTC-USD.csv file for conversion.
First, I have exported the Bitcoin market price from https://coinmarketcap.com/currencies/bitcoin/historical-data/ (the first entries seemed to be the same as in BTC-USD.csv file) as CSV file (although semicolon was used as a delimiter) in the period from February 24, 2012 until September 16, 2014. The format required by the current BTC-USD.csv file is the following:
I have used some command-line processing using awk to convert the exported CSV file (called missing_bitcoins.csv here) to the format of theBTC-USD.csv file:
The format is the same as required (note, I have noticed that "Adj Close" and "Close" are the same in BTC-USD.csv, which can be verified with awk -F, '$5 != $6 { print $0; }' BTC-USD.csv, which prints only the header):
Now, the dates in the BTC-USD.csv file are from February 24, 2012:
I have also noticed that "Volume" column has 0s for a large part of 2013. I am not sure if this is a problem, but I have not seen this column being used anywhere
Description of the problem
During my work with the Ransomwhere dataset, I have noticed that the first transaction in the current version of the Ransomwhere dataset is on 2012-03-09 03:31:54, which can be verified using the following Unix pipeline (here and further we assume the latest version of the JSON exported from the www.ransomwhe.re website is stored in the
data.json
file in the current directory):However, the first entry in the
BTC-USD.csv
file, which is used to convert the amount in BTC to USD, is 2014-09-17. As a result, transactions before this date result in 0 in the amount in USD, e.g. as can be seen here (can be verified using sth likejq '.' data.json | grep -C 10 '"amountUSD": 0$'
):In total, there were 640 transactions before September 17, 2014, which can be verified as follows:
Because of the missing dates, the total sum in BTC on the website is also lower than it should be:
(while the website says ₿53,179.76)
The total sum in USD is the same as on the website, though:
Description of the pull request
This pull request adds the missing Bitcoin prices before September 17, 2014 to the
BTC-USD.csv
file for conversion.First, I have exported the Bitcoin market price from https://coinmarketcap.com/currencies/bitcoin/historical-data/ (the first entries seemed to be the same as in
BTC-USD.csv
file) as CSV file (although semicolon was used as a delimiter) in the period from February 24, 2012 until September 16, 2014. The format required by the currentBTC-USD.csv
file is the following:The format of the exported CSV file is the following:
I have used some command-line processing using
awk
to convert the exported CSV file (calledmissing_bitcoins.csv
here) to the format of theBTC-USD.csv
file:The format is the same as required (note, I have noticed that "Adj Close" and "Close" are the same in
BTC-USD.csv
, which can be verified withawk -F, '$5 != $6 { print $0; }' BTC-USD.csv
, which prints only the header):Now, the dates in the
BTC-USD.csv
file are from February 24, 2012: