Closed ericmalekos closed 4 months ago
I see, there were some non-ATCGN bases in the assembly, and maybe lower case characters, I converted lower to uppercase and everything that wasnt ATCGN to N with this line and it worked
awk '/^>/ {print; next} {gsub(/[^ATCGN]/, "N"); print toupper($0)}' GCA_000001405.15_GRCh38_no_alt_analysis_set.fna > upper_GCA_000001405.15_GRCh38_no_alt_analysis_set.fna
Jacusa2 does not support non ACGTN bases. Lower case should work.
Hi, I'm runnning the following command which works fine for a while before throwing an exception. Any idea what could be causing this or what to do about it?
I used STAR to align RNASeq to the GCA_000001405.15_GRCh38 reference genome, and am using the same genome fasta here. https://ftp.ncbi.nlm.nih.gov/genomes/all/GCA/000/001/405/GCA_000001405.15_GRCh38/seqs_for_alignment_pipelines.ucsc_ids/