NAL-i5K / GFF3toolkit

Python programs for processing GFF3 files
Other
94 stars 27 forks source link
bioinformatics gff gff3 gff3-format

GFF3toolkit - Python programs for processing GFF3 files

example workflow Build status PyPI version Documentation Status

Background

The GFF3 format (Generic Feature Format Version 3) is one of the standard formats to describe and represent genomic features. It is an incredibly flexible, 9-column format, which is easily manipulated by biologists. This flexibility, however, makes it very easy to break the format. We have developed the GFF3toolkit to help identify common problems with GFF3 files; fix 30 of these common problems; sort GFF3 files (which can aid in using down-stream processing programs and custom parsing); merge two GFF3 files into a single, non-redundant GFF3 file; and generate FASTA files from a GFF3 file for many use cases (e.g. feature types beyond mRNA).

Frequently Asked Questions/FAQ

Prerequisite

Installation

Stable release on PyPI

pip install gff3tool

Latest version

pip install git+https://github.com/NAL-i5K/GFF3toolkit.git

Current Functions

Usage

Detect GFF3 format errors (back)

Correct GFF3 format errors (back)

Merge two GFF3 files (back)

Sort a GFF3 file (back)

Generate biological sequences from a GFF3 file (back)

Example Files (back)

Internal Dependencies (back)