Closed xiw588 closed 1 year ago
The only way to explain this is you get an uncomplete file. Show then console information when you run XenaDownload()
can clarify this.
For such a big dataset, I recommend adding options provided in https://cran.r-project.org/web/packages/UCSCXenaTools/vignettes/USCSXenaTools.html#how-to-resume-file-from-breakpoint.
Also you can use wget
command in terminal for downloading it.
wget -c https://tcga-pancan-atlas-hub.s3.us-east-1.amazonaws.com/download/EB%2B%2BAdjustPANCAN_IlluminaHiSeq_RNASeqV2.geneExp.xena.gz
Hi Shixiang,
Thank you so much for your help! I have a follow-up question regarding the normalized pan-cancer gene expression. Do you notice that there are some negative values? The official document says they conducted log2(RESM+1), and this should not introduce any negative values based on my understanding.
Thanks in advance!
Hi, the Xena https://xenabrowser.net/datapages/?dataset=EB++AdjustPANCAN_IlluminaHiSeq_RNASeqV2.geneExp.xena&host=https://pancanatlas.xenahubs.net says the unit is log2(norm + 1)
. So the case is due to some operations described at https://www.synapse.org/#!Synapse:syn4976363
Hi Shixiang,
I am wondering why following the r code you provided online gives the number of sample identifiers (genes) only 6000+ while the Xena official website have more than 20000? https://xenabrowser.net/datapages/?dataset=EB++AdjustPANCAN_IlluminaHiSeq_RNASeqV2.geneExp.xena&host=https://pancanatlas.xenahubs.net
Can you please clarify the difference between them? Thanks