biocore / microprot

structural annotation pipeline for microbial genomes and metagenomes
BSD 3-Clause "New" or "Revised" License
1 stars 6 forks source link

REQUEST: process_fasta.extract_sequences should be able to read sequence ranges #40

Closed tkosciol closed 7 years ago

tkosciol commented 7 years ago

I'd like to be able to call

process_fasta.extract_sequences('input.faa', identifiers=(1,10))

to read sequences from 1 to 10 (i.e. 10 sequences) from input.faa file.

Application I have in mind for this, is to be able to efficiently get a manageable number of sequences from a large fasta file. For example, I have a FASTA file with 10k sequences, but I only want to work on sequences 1-100 now, later I will read 101-200, etc.