Closed lcolladotor closed 7 years ago
This issue is now solved. We changed the annotation for the exon and gene counts to Gencode v25 (CHR regions). The changes are live now via https://jhubiostatistics.shinyapps.io/recount/ and can be accessed with recount
version 1.0.8 (Bioc-release) and 1.1.14 (Bioc-devel). It'll take about 36-48 hrs for the latest binaries of recount
to be available via biocLite()
.
I'm migrating the discussion on this topic to the issues page.
Original issue with POU5F1
About a week ago https://gist.github.com/ronstewart reported that POU5F1 is missing from recount in an email to me. That lead to my reply involving the following gist https://gist.github.com/lcolladotor/374a0de6be5c202bbf216295989e534a:
Actual issue
Ron replied with a comment on the gist where he is concerned that other genes might be missing in recount.
In
recount::reproduce_ranges()
we useGenomicFeatures::genes()
which by default has the argument has the argumentsingle.strand.genes.only
set toFALSE
. As seen in the documentation, this argument also drops genes that are on two different chromosomes.This explains why
POU5F1
is missing as shown below because it is present inchr6
(a canonical chr) and a few alternative chromosomes.Status
For now, we are discussing internally what we'll do. There are several options and in any case it's possible to compute the coverage matrix for genes like
POU5F1
that are present in a canonical chromosome and several non-canonical ones; these are genes researches like Ron Stewart might be interested in.Best, Leo