Open hoelzer opened 2 weeks ago
Works when I do
awk '{print $1}' GCA_012799365.1_ASM1279936v1_genomic.fna > GCA_012799365.1_ASM1279936v1_genomic.fna.renamed
and use this as reference.
However, I think a user might often end up in a difference between the IDs of the ref and the BAM depending on the used mapper. It might be good to implement ID mapping always only until the first white space in the FASTA ID...
Hey,
I tried the script on my data set (which is a fragmented bacterial genome extracted from a metagenome).
but I get:
I should probably mention that I am trying on a MacBook...
I am unsure, but maybe the R script does not extract the FASTA headers correctly and then they don't match between the FASTA and BAM.