gregorykucherov / mreps

mreps: software for tandem repeat identification in DNA
http://mreps.univ-mlv.fr
GNU General Public License v2.0
13 stars 5 forks source link

GFF output #9

Open tseemann opened 4 years ago

tseemann commented 4 years ago

it would be great if mreps supported GFF output, which is standard for feature annotation.

Format: https://github.com/The-Sequence-Ontology/Specifications/blob/master/gff3.md

Official Genbank tags:

Qualifier       /rpt_family=
Definition      type of repeated sequence; "Alu" or "Kpn", for example
Value format    "text"
Example         /rpt_family="Alu"

Qualifier       /rpt_type=
Definition      structure and distribution of repeated sequence
Value format    tandem, direct, inverted, flanking, nested, dispersed, terminal, 
                long_terminal_repeat, non_ltr_retrotransposon_polymeric_tract, 
                centromeric_repeat, telomeric_repeat, x_element_combinatorial_repeat,
                y_prime_element and other
Example         /rpt_type=INVERTED
Comment         the values are case-insensitive, i.e. both "INVERTED" and "inverted" 
                are valid; For the most current list of allowed values and their definitions please visit:
                http://www.insdc.org/controlled-vocabulary-rpttype-qualifier

Qualifier       /rpt_unit_range=
Definition      identity of a repeat range
Value format    <base_range>
Example         /rpt_unit_range=202..245
Comment         used to indicate the base range of the sequence that constitutes 
                a repeated sequence specified by the feature keys oriT and
                repeat_region; qualifiers /rpt_unit_range and /rpt_unit_seq
                replaced qualifier /rpt_unit in December 2005

Qualifier       /rpt_unit_seq=
Definition      identity of a repeat sequence
Value format    "text"
Example         /rpt_unit_seq="aagggc"
                /rpt_unit_seq="ag(5)tg(8)"
                /rpt_unit_seq="(AAAGA)6(AAAA)1(AAAGA)12"
Comment         used to indicate the literal sequence that constitutes a
                repeated sequence specified by the feature keys oriT and
                repeat_region; qualifiers /rpt_unit_range and /rpt_unit_seq
                replaced qualifier /rpt_unit in December 2005