Closed hejing3283 closed 5 years ago
I think because your data set is only 8 samples, the test size is too small. The test size fraction is how much is used for the valid and test sets. If you have it set to 0.2, that's 1.6 samples for 2 sets which would not work. I would recommend in this case training with the LOO = 1 where 1 sample gets used to validation and one gets used for the test set. One can set this parameter in either a monte-carlo simulation or k-fold cross val. Let me know if this is was the issue and I'll write something into the code to catch when this happens and alert the user.
Thanks for the explanation. I realized it and tried with more data, each label has 8 samples, changed test for 0.5 which allows 2 samples for validation and test independently. Now I am getting a new error
err msg start-----------
Traceback (most recent call last):
File "run_deepTCR_1_main.py", line 84, in
It seems like in the output statistics, something is getting passed to the print statement that is not correct. You said each folder now has 8 csv files in each one?
Yes. I added more samples. Now each folder has 8 .csv files. The same format as before
if you send your data or a part of it to my email, i might be able to better assess the issue you are having. jsidhom1@jhmi.edu
I would also recommend trying this and seeing if it works after you load the data.
DTCR_WF.Monte_Carlo_CrossVal(folds=5,LOO=1)
Thanks for your help ahead! I am sending you 2 of the 4 directory.
On Tue, Apr 23, 2019 at 11:27 AM John-William Sidhom < notifications@github.com> wrote:
if you send your data or a part of it to my email, i might be able to better assess the issue you are having. jsidhom1@jhmi.edu
— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub https://github.com/sidhomj/DeepTCR/issues/4#issuecomment-485852530, or mute the thread https://github.com/notifications/unsubscribe-auth/AAUAIIMMKPPK6BSVVKF5GQTPR4TFXANCNFSM4HHXTAGQ .
Just tried the MCCV, similar error
err msg start---------------------------------------------------------------
Traceback (most recent call last):
File "run_deepTCR_1_main.py", line 85, in
Also, I was getting waring msgs say some of the tensorflow functions are depreciated, not sure if this is related.
warning msg start------------------------------------------------------------ WARNING:tensorflow:From /Users/jing.he1/anaconda3/envs/dl/lib/python3.7/site-packages/tensorflow/python/framework/op_def_library.py:263: colocate_with (from tensorflow.python.framework.ops) is deprecated and will be removed in a future version. Instructions for updating: Colocations handled automatically by placer. WARNING:tensorflow:From /Users/jing.he1/anaconda3/envs/dl/lib/python3.7/site-packages/DeepTCR-1.2.15-py3.7.egg/DeepTCR/functions/Layers.py:98: conv2d (from tensorflow.python.layers.convolutional) is deprecated and will be removed in a future version. Instructions for updating: Use keras.layers.conv2d instead. WARNING:tensorflow:From /Users/jing.he1/anaconda3/envs/dl/lib/python3.7/site-packages/DeepTCR-1.2.15-py3.7.egg/DeepTCR/functions/Layers.py:99: flatten (from tensorflow.python.layers.core) is deprecated and will be removed in a future version. Instructions for updating: Use keras.layers.flatten instead. WARNING:tensorflow:From /Users/jing.he1/anaconda3/envs/dl/lib/python3.7/site-packages/DeepTCR-1.2.15-py3.7.egg/DeepTCR/functions/Layers.py:102: dropout (from tensorflow.python.layers.core) is deprecated and will be removed in a future version. Instructions for updating: Use keras.layers.dropout instead. WARNING:tensorflow:From /Users/jing.he1/anaconda3/envs/dl/lib/python3.7/site-packages/DeepTCR-1.2.15-py3.7.egg/DeepTCR/DeepTCR.py:3098: dense (from tensorflow.python.layers.core) is deprecated and will be removed in a future version. Instructions for updating: Use keras.layers.dense instead. WARNING:tensorflow:From /Users/jing.he1/anaconda3/envs/dl/lib/python3.7/site-packages/tensorflow/python/ops/math_ops.py:3066: to_int32 (from tensorflow.python.ops.math_ops) is deprecated and will be removed in a future version. Instructions for updating: Use tf.cast instead. WARNING:tensorflow:From /Users/jing.he1/anaconda3/envs/dl/lib/python3.7/site-packages/tensorflow/python/ops/math_grad.py:102: div (from tensorflow.python.ops.math_ops) is deprecated and will be removed in a future version. Instructions for updating: Deprecated in favor of operator or tf.math.divide. 2019-04-23 11:32:31.648901: I tensorflow/core/platform/cpu_feature_guard.cc:141] Your CPU supports instructions that this TensorFlow binary was not compiled to use: AVX2 FMA warning msg end------------------------------------------------------------
I just ran the following code and it worked fine..
The tensorflow deprecation warnings are normal. Will eventually need to update the code for tensorflow 2.0 but for now, it should work fine.
The only difference I have is the Get_Data parameter positions. But I think it is not position sensitive.
I changed it, used the same script as you did, uninstall and install the package again, and it worked now!
Thanks so much!! Much appreciated!
Awesome! I just made some final updates. I would re-install the latest version 1.2.17.
Thanks!
Got you! 👍
I am running a testing using my own data After loading the data successfully, I got an error when training:
Load Data from directories
DTCR_WF.Get_Data(directory='data_test/', Load_Prev_Data=False, aggregate_by_aa=True, aa_column_beta=1,v_beta_column=3,d_beta_column=4,j_beta_column=5, count_column=6,n_jobs = 2, sep=",") DTCR_WF.Get_Train_Valid_Test(test_size=0.2) DTCR_WF.Train() error msg start --------------------------------------------------------------------------- ValueError Traceback (most recent call last)