IMPROVEMENT: split_sequence naming

biocore / microprot

structural annotation pipeline for microbial genomes and metagenomes

BSD 3-Clause "New" or "Revised" License

1 stars 6 forks source link

Closed tkosciol closed 7 years ago

tkosciol commented 7 years ago

Currently, split_sequence names outputs this way:

>NZ_GG666849.1_2 # 798 # 2885 # -1 # ID=1_2;partial=00;start_type=TTG;rbs_motif=AGxAGG/AGGxGG;rbs_spacer=5-10bp;gc_cont=0.499_1-222

i.e. residues FROM and TO are at the very end. It's acceptable, because it makes the name unique, but it wold be better to have it like:

>NZ_GG666849.1_2_1-222 # 798 # 2885 # -1 # (...)

i.e. name in the format: > genome_gene_residues # other_annotations

tkosciol commented 7 years ago

please commit changes to split_search branch