Closed LiuCanidk closed 1 week ago
For the duplicated protein names, you can remove these proteins from the unseen protein data file, and just use their data from the seen protein data file.
About the NA values, the most likely reason is that if there are some genes which contain all-zero values or NA values in your transcriptomic data, then certain calculation steps in SPIDER's may generate results of NA. You can remove the predicted NA values for downstream analysis.
For the duplicated protein names, you can remove these proteins from the unseen protein data file, and just use their data from the seen protein data file.
About the NA values, the most likely reason is that if there are some genes which contain all-zero values or NA values in your transcriptomic data, then certain calculation steps in SPIDER's may generate results of NA. You can remove the predicted NA values for downstream analysis.
Thanks for your reply!
I found something weird in the prediction result of SPIDER.
The output
How should I process the seen and unseen protein data if they have overlap? select one? average? I did not notice any relevant tips in the downstream tutorial.
Also, when I filter the unseen protein based on the confidence coefficient provided by SPIDER, I found some NA values in the confidence file. Of course I can just remove these NA values but I am wondering why, with no error in the process of SPIDER prediction
Any discussion would be greatly appreciated!