Closed junyho486 closed 2 years ago
that bug should be fixed now in v 2.1.17.
As for the Get_Data method, it takes files that are csv/tsv. just change the extension of the file and it should work.
Well, I did a clean install today, like so:
The Files do have .tsv format, I just changed to .txt for uploading them on github.
Sorry. are both bugs still there? or just the latter?
Thank you for the quick response!
I installed DeepTCR several times today trying to find the bug. Currently I am running the stable installation and see both bugs.
Edit: I reinstalled into a new env using pip3 install git+https://github.com/sidhomj/DeepTCR.git and the first bug seems to be resolved, but the second one persists.
second bug fixed. it was an issue with the expected order of columns in the files. I fixed the loading function so the order does not matter anymore. let me know if it works now!
Thank you so much! I was struggling with this one all day... Now both issues are resolved for the unsupervised and supervised model!
Ps: Also congrats for creating DeepTCR it is a very impressive tool!
TRB.txt I have TCRseq Data which was annotated by IGB and preprocessed for DeepTCR as indicated in the tutorial. I have 9 Samples with many TCRs, here is an excerpt of the Data for one Sample:
If I run the Tutorial using the example Data from the Repository for supervised Sequence Classification, loading Data, cluster etc. works perfectly (except for DTCR_SS.Train() which throws:
DTCR_SS.Monte_Carlo_CrossVal, DTCR_SS.K_Fold_CrossVal etc. work.
If I then replace the Folders in Data/Murine_Antigens with my Samples, DTCR_SS.Get_Data() which usually takes just a moment to load the data gets stuck (stopped it after 40min).
Even after only using TCRs >= 1000 Reads which results in Tables between 50-80 rows, does not resolve the issue.
Output:
Is there anything that could cause this kind of Bug?
Attached you will find the data for one Sample for TCR-seqs > 1000 (as .txt file saved .tsv)
Thank you in Advance for your help!