molbiodiv / bcdatabaser

A pipeline to create reference databases for arbitrary markers and taxonomic groups from NCBI data
https://bcdatabaser.molecular.eco
MIT License
6 stars 3 forks source link

Combine filtered and raw sequences #9

Closed iimog closed 6 years ago

iimog commented 6 years ago

The pipeline currently returns an unfiltered fasta file with all (taxonomy annotated) sequences. If a primer file is provided it additionally returns a file with the filtered/cropped/oriented sequences. However, preliminary tests revealed that many sequences get lost in this step so it might be good to create a combined output file with filtered sequences if available and raw sequences for all the others.