Closed zgb963 closed 1 month ago
Please take another look at the vignettes on the Signac website, it is not necessary or recommended to read the fragment file into R. You just need to give the path to the file on disk when creating your chromatin assay
@timoast Yes I realized soon after I posted and forgot to mention that I fixed it, thanks
Hi Signac team,
I've been trying to run the Joint RNA and ATAC analysis: 10x multiomic tutorial with my multiomics data that I processed with the 10x Genomics Cell Ranger ARC pipeline, but it's taking a very long time (hours & days) to just read in the atac__fragments.tsv.gz file, and it seems to be stuck on creating the ATAC assay and adding it to the Seurat object. For reference, the genome FASTA I used (downloaded from UCSC Genome Browser) to map the reads is from Macaca fascicularis. The genome annotation GTF (which has human hg38 gene names and macfas5 coordinates) was made with the software
Liftoff
.I was running this on an HPC system with 32 cpu cores and 12000MB of memory per core.
Right now I have 7 samples. I don't have any issues reading the count matrices for each sample.
However, when I'm reading in the atac fragments files (which are ~1-3GB each), it takes a long time to load in and sometimes it takes so long that it crashes RStudio. Is there a better function I can use to read in these files faster?
I've also tried reading in the atac fragment files in parallel using the
parallel
R package, and it still takes a long time to run. I saved the matrix files and the fragment files as RDS files to see if loading would be faster but that doesn't seem to be the case.I tried reading in just the first sample so that I can create the Seurat Object containing the RNA data and then create an ATAC assay and adding it to the object
When I get to creating the objects, it's creates the Seurat object as normal, but then it gets stuck on creating the ATAC assay so it doesn't get created and isn't a part of the Seurat object.
I'm not sure why this is happening or if there's something wrong with the atac fragments file?
This is what the atac fragments file looks like when it's loaded in. I noticed that inside the file doesn't have tab separated columns with the chromosome number, start and end position, cell barcode, and peak score. All the data is in one column. Could that be the reason why it's not loading in prooperly? Or is it just because the file is gzipped?