molbiodiv / bcdatabaser

A pipeline to create reference databases for arbitrary markers and taxonomic groups from NCBI data
https://bcdatabaser.molecular.eco
MIT License
6 stars 3 forks source link

Increase default for --seqs-per-taxon #28

Closed iimog closed 5 years ago

iimog commented 5 years ago

Experiments by Andreas Kolter indicate that using more than 3 sequences per taxon should improve classification accuracy. He recommends a default cut-off of 9 (personal communication). This default will propagate to the web interface.