10XGenomics / cellranger

10x Genomics Single Cell Analysis
https://www.10xgenomics.com/support/software/cell-ranger
Other
364 stars 94 forks source link

STAR aligner call / UMI collapsed temp data #2

Closed sklages closed 6 years ago

sklages commented 7 years ago

Hi, this is more a information request than a software issue.

I'd like to use cellranger for a custom genome reference. I know that my data will not map uniquely on that genome, I expect multiple hits of many reads. The standard cellranger count workflow will discard such reads. That would leave me without data at the end ;-)

I'd appreciate any hints ..

best, Sven

pryvkin10x commented 7 years ago

Hi Sven,

There could be a few things going on: STAR calls as unmapped those reads w/ >10 genomic alignments by default, so if you expect >10 alignments to be typical, you'll need to change outFilterMultimapNmax by adding the argument to the STAR call here:

https://github.com/10XGenomics/cellranger/blob/master/lib/python/cellranger/reference.py#L527

Once you've done that, or if the alignment number is typically <10, CR will still not consider these reads for UMI counting but it will report them in its final BAM file.

If you want to realign the data yourself (while preserving cell barcodes and UMIs) you have a few options, but unfortunately none of them are easy.

kobeho24 commented 6 years ago

One naive question, how can I kill the pipeline automatically as you mentioned above? I cannot find any fastq file under EXTRACT_READS directory after complete cellranger run.