Closed bgphd closed 3 years ago
It looks like your specifying the wrong input for umi_tools
.
As it stands, you sort a file called [..]-beststratam1.bam
and create a sorted file called [...]-beststratam1.sorted.bam
, which you then index. After this, you run umi_tools
on the unsorted [..]-beststratam1.bam
file.
Switch the umi_tools
input to the sorted file and it should all be OK.
No, sorry that is not the answer. I have tried shuffling names many many times:
(base) bgold@Berts-MacBook-Pro PRE_norm_001 % ls -1
PRE_norm_001-UMIextraction-fromrawreads.log
PRE_norm_001-bowtie-genomeaftermiRNA-beststratam1.log
PRE_norm_001-bowtie-maturemiRNA-beststratam1.log
PRE_norm_001-deduplicate-genomeaftermiRNA-uniquemethod.log
PRE_norm_001-deduplicate-matureMirna-uniquemethod-beststratam1.log
PRE_norm_001-directUMIextracted-min18max30L.fastq
PRE_norm_001-directUMIextracted-readlengthfilter-cutadapt.log
PRE_norm_001-directUMIextracted.fastq
PRE_norm_001-genomeaftermiRNA-aligned-bowtie1-beststratam1.bam
PRE_norm_001-maturemiRNA-aligned-bowtie1-beststratam1.bam
PRE_norm_001-maturemiRNA-aligned-bowtie1-beststratam1.sam
PRE_norm_001-maturemiRNA-aligned-bowtie1-beststratam1.sorted.bai
PRE_norm_001-maturemiRNA-aligned-bowtie1-beststratam1.sorted.bam
PRE_norm_001-maturemiRNA-unalignedReads-bowtie1-beststratam1.fastq
PRE_norm_001-tagged.bam
PRE_norm_001_counts-finaloutput-uniquemethod-maturemiRNA.txt
PRE_norm_001_counts-uniquemethod-maturemiRNA.log
PRE_norm_001_deduplicated-genomeaftermiRNA-uniquemethod.bam
PRE_norm_001_deduplicated-matureMirna-uniquemethod-beststratam1.bam
I have also tried naming:
mv PRE_norm_001-maturemiRNA-aligned-bowtie1-beststratam1.sorted.bai PRE_norm_001-maturemiRNA-aligned-bowtie1-beststratam1.sorted.bam.bai
With no joy.
The script I am using is from this paper:
A bioinformatics approach to microRNA-sequencing analysis Pratibha Potlaa, Shabana Amanda Alic, Mohit Kapoora Osteoarthritis and Cartilage Open 3 (2021) 100131
I never ask questions until I have struggled for days...
The script from the paper is here: https://www.dropbox.com/s/2t2gqvkqsr58mjv/PotlaP_miRNA_pipeline.zip?dl=0
OK, well the error you posted is perfectly well explained by the absence of any index file for the input you specified. It may not be the complete answer to the problem you have been facing but it does explain the error posted and you provided literally no context in the original issue. It's always helpful if you give some more information about what you have tried.
I note you have spaces in the file path, but that shouldn't be an issue and a quick check manually suggests that's not the problem.
Could you please re-run umi_tools count
on the correct input, e.g the sorted BAM PRE_norm_001-maturemiRNA-aligned-bowtie1-beststratam1.sorted.bam
which has an index and include the log here.
Wow! It worked. I am supposing the spaces in the file directory names ARE the problem!
Commands in Script
Terminal Output
checking for .bam and .bai