Rfam / rfam-production

Rfam production pipeline
Apache License 2.0
5 stars 3 forks source link

Add missing SEED sequences to fasta files? #48

Closed kalvari closed 3 years ago

kalvari commented 5 years ago

Extracted drosophila virilis sequences from the fasta files on the ftp and noticed a mismatch in the number of sequences included in the fasta files compared to the number of sequences reported on the website. The difference in the mismatch equals the number of SEED sequences, so will need to investigate further, add the seed sequences to the fasta files update the ftp

kalvari commented 3 years ago

This has been resolved with the latest update to the fasta generation code, which directly extracts SEED sequences from the Rfam.seed file