jessieren / DeepVirFinder

Identifying viruses from metagenomic data by deep learning
Other
116 stars 32 forks source link

Encoding takes weeks to finish #46

Open enricorox opened 1 year ago

enricorox commented 1 year ago

Hi! I downloaded all the training set used in the paper and concatenated all the virus and prokaryote sequences in two different fasta file of about 76MB and 110GB respectively.

Running the host encoding is taking more than one week... the command used is python ../encode.py -i ../datasets/training/host-training.fa -l 500 -p host

But the readme says that it should take about 5 minutes. I am doing something wrong? Thanks