Closed carolynzy closed 3 years ago
The line that throws the error has spaces not tabs:
$ awk -F"\t" 'NF!=3' reference_info.tsv | cat -A
Virus 1 NC_001802.1$
Should work ok if you change the spaces to tabs.
Thank you, Marting! That indeed is the problem. I have no idea how it got there, since other lines are in good shape.
Hi, I'm trying to prepare the reference while encountered the following issue:
I couldn't upload the references.fasta file since it's too large (3.9G) but it could be donwloaded here: https://1drv.ms/u/s!Ahu3aHGoa85BhpgJ54IOPupzotYjFA?e=MyObb2 (md5sum 0c3d0d79c6d5fa163423cfbea91917ad)
And the tsv file is the this one:
reference_info.tsv.tar.gz
I used the latest version of genome sequences as I could find and made the tsv file following to the instructions from this link:https://github.com/iqbal-lab-org/clockwork/wiki/Preparing-remove-contamination-reference-data. Did I do something wrong?Thank you in advance!