Open weir12 opened 1 year ago
The problem has been solved. I mistakenly understood the meaning of -- autodownload hsa
. This option will automatically download the reference genome and annotation file of Homo sapiens, and merge the corresponding Homo sapiens reference genome and annotation file in -- gene fastas.csv -- annotation gtfs.csv
in the way of concat, which will directly double the size of the genome (two identical duplicate human genomes)
I suggest whether you can add genome de-duplication steps or warn users that when the workflow is input with two duplicate genome files before the concat genome?
Hey @weir12 thanks for reporting! Yes, that is exactly how it works. We can mention that in the README more prominent and also in the --help
message.
Hi, When I run rnaflow, the monitor caught the following error:
I checked the working directory of this step and rerun the pipeline with the same parameters. I found that the index file generated by hisat2-build is
. ht2l
, not. ht2
expected in the nf file. I found that hisat2 will adopt another index suffix for the large genome,My reference genome is Homo sapiens.”In the case of a large index these suffixes will have a ht2l termination.”
https://github.com/DaehwanKimLab/hisat2 Thanks!