Closed amitpande74 closed 3 years ago
Hi,
Thank you for your interest in the software.
There are a few things that could be contributing to your error:
1) There are no replicates in your experiment: As DESeq2 indicated in its error message, it is no longer able to calculate dispersion if there are no replicates in your experiment. You would need at least one replicate (either treatment or control, optimally both) to have DESeq2 work. However, you can try to take the count table file (sample_sorted_test.cntTable
) and use other DE software that could handle no-replicate experiments.
2) I noticed that you're using hg19 refseq for your gene GTF, but GRCh37 Ensembl for your TE GTF. Please note that although the genome builds are equivalent, they utilize very different nomenclature for their chromosomes (e.g. chr1
vs 1
). I noted in your log that there were no TE counted, and this difference in chromosome nomenclature could explain that. If you had mapped to hg19, we recommend using the hg19 TE GTF instead.
Also, the[E::idx_find_and_load] Could not retrieve index file for '.1617970850.172773.bam'
error can be safely ignored. See #82 for more details.
Please let us know if you have other questions.
Thanks.
Dear @olivertam , I have paired end reads (8 control replicates and 10 diseased). Tired running 1 replicate each (diseased and control). Will try to run all the replicates of diseased and control datasets together and get back to you. Thanks for the 2nd point though this was deliberate as I was too hyper to use the tool and see the output first. But thanks for mentoring and a quick response.
warm regards, Amit.
Hi Amit,
Thanks for the clarification.
Given how many samples you have, you can either run TEtranscripts
on all of them (one step, though will take some time because it will process each library in series), or run TEcount
(included with TEtranscripts
) on each of them separately (thus parallelizing), then join the output from each run together to make a combined count table, and then run differential analysis (e.g. DESeq2) independently.
The latter is better if you have lots of libraries (since you can quantify in parallel), but it would require you to manually create the combined count table (e.g. using join
), and set up your own differential analysis script (rather than have it generated by TEtranscripts
).
Just wanted to give you some options if you want to quickly look at your data.
Thanks.
Hi, I got an error my running the tool:
Kindly help.