Closed alwhiteh closed 2 years ago
Can you show the output of:
gr <- granges(ly2[['peaks']])
gr <- gr[seqnames(gr) == "chr3L"]
max(end(gr))
Hi Tim,
I got an error with that code:
max(end(gr))
[1] -Inf
Warning message:
In max(end(gr)) : no non-missing arguments to max; returning -Inf
However, when I used "3L" instead of "chr3L" I got:
> max(end(gr))
[1] 28110291
Both my peaks and genome file should have similarly formatted chromosome names:
> head(gr)
GRanges object with 6 ranges and 0 metadata columns:
seqnames ranges strand
<Rle> <IRanges> <Rle>
[1] 3L 32221-35564 *
[2] 3L 39830-40723 *
[3] 3L 41776-42611 *
[4] 3L 57607-58043 *
[5] 3L 73603-73803 *
[6] 3L 85160-85561 *
and
> head(seqlevels(fly_genome))
[1] "2L" "2R" "3L" "3R" "4" "X"
I did, however, notice in the fly_genome object that seqinfo -> seqnames contains the "Chr2L" formatting, while user_seqnames contains the "2L" formatting. Is it possible that the command is using the former part of the genome object?
@alwhiteh are your coordinates 0-based? If so can you try the suggestions here: https://github.com/timoast/signac/issues/864
If not could you send me the granges object for your assay? You can save with the following code:
gr <- granges(ly2[['peaks']])
saveRDS(gr, "granges_signac.rds")
and email to tstuart@nygenome.org
Hi @alwhiteh
I should have realized earlier, but you indeed have peaks that extend beyond the end of chromosome 3L:
> end(fly_genome$`3L`)
[1] 28110227 1
> max(end(gr[seqnames(gr) == "3L"]))
[1] 28110291
I'd suggest double-checking the version of the genome used to map the data
Hi,
I'm working with a multiome single cell dataset from drosophila that was generated using the 10X platform. When following the multiome vignette, I realized that an R package for the dm6 ensemble genome did not exist (at least in a format that played nicely with the CreateChromatinAssay function). Therefore, I generated the annotation data using sitadela, and with some minor seqlevels formatting changes, we able to complete most of the vignette, with the exception of the RegionStats function. I get the following error:
Error in loadFUN(x, seqname, ranges) : trying to load regions beyond the boundaries of non-circular sequence "chr3L"
I'm not aware of a public dm6 dataset, but my code is attached.
I am running Mac OS 10.15.7 and my session info is: