Open ntrpnr opened 6 years ago
Also tried with kaggle dataset - I am getting:
---------------------------------------------------------------------------
MemoryError Traceback (most recent call last)
<ipython-input-18-c25ebc6ecaf6> in <module>()
----> 1 X_train, Y_train, X_test, Y_test, Y_daybefore, unnormalized_bases, window_size = load_data("Bitcoin Data.csv", 50)
2 print (X_train.shape)
3 print (Y_train.shape)
4 print (X_test.shape)
5 print (Y_test.shape)
<ipython-input-8-c56ba76693f8> in load_data(filename, sequence_length)
36 #Normalizing data by going through each window
37 #Every value in the window is divided by the first value in the window, and then 1 is subtracted
---> 38 d0 = np.array(result)
39 dr = np.zeros_like(d0)
40 dr[:,1:,:] = d0[:,1:,:] / d0[:,0:1,:] - 1
MemoryError:
Please post url of datasource where you got csv from.
@llSourcell Could you please share the link to data source used to train this model ?
Same thing all the time with these code links in the vid - says 'its easy to find any data'. But there are tons of CSV files on Kaggle, and most of them don't work (no doubt for a reason). Siraj needs to specify a little more than, check out this code its easy to use. FYI should also have mentioned its for Python2 not python3.
All of the mentioned fields can be retrieved from (or computed using data from), a bevy of free data sources.
We'll have to do a bit more legwork to get the data formatted correctly, but perhaps to fully understand how the network configuration / preprocessing works it can be valuable to reconfigure the existing code for a custom dataset.
I can't find any CSV online either that 100% matches the specified schema, but hey, sometimes building/cleaning your own dataset can be half the fun.
I'd like to see the data too. Many we can collaborate to build that dataset unless the author can provide the code to do so. I think most of the value in this approach is from the dataset and not the modeling techniques, although RNN's are powerful for time series prediction. But right now I am much more interested in the data.
Please post a valid csv example, because it don't work... :( I will create my own dataset, but what is the correct schema?
Dataset for training the model or post the correct data schema for the same.
The tutorial dataset schema is specified in the Step 1
notebook cell -
The columns of data and their definitions are as follows:
I imagine that the model can still be trained effectively on different schemas too - but you may have to adjust the shape of the tensor depending on the number of features.
Check the code near this comment for reference -
#Convert the data to a 3D array (a x b x c)
#Where a is the number of days, b is the window size, and c is the number of features in the data file
where are Dataset?
url of dataset please.
please provide the datasets, thanks
yeah as above....can you please provide dataset? thank you.
Note you can use any dataset quite easily in his code, it is mostly generic. The main part you'd have to be aware of is the index of the BTC prices. They seem to be 20 in his code - look where he gets the labels in y_train. So if you change that index to match the index in your own data the rest of the code should work with whatever dataset you want.
TypeError Traceback (most recent call last)
@simonhughes22 I agree with you. I test the codes with a 4 features dataset from Kaggle, they work. However, other than some codes questions, I am wondering how many days of those codes can predict and how can you see them? Will be appreciated if I can get your take on this.
hi @TingALin, which dataset did u use? this https://www.kaggle.com/mczielinski/bitcoin-historical-data/data ?
@calvinchankf yes, but only with open, close, high, low as the features for testing
i used this dataset which include all features that Raval was talking about. But my statistic is following: Precision: 0.5376884422110553 Recall: 0.5783783783783784 F1 score: 0.5572916666666665 Mean Squared Error: 0.217757766929
@calvinchankf do you know how to print out the predicted price btw? cause I don't see the predicted price from the code.
@TingALin no i gave up trying this sample cos i think using future features(bi-direction) is kind of unrealistic to predict future prices. so i ended up studying other samples
In the referenced video, we are told that you can find a dataset on Kaggle. The datasets which can be found there does not contain all the columns that this notebook requires.
So where can we find a dataset that can be used together with this model?