ComparativeGenomicsToolkit / taffy

This is a library C/Python/CLI for working with TAF (.taf,.taf.gz) and MAF (.maf) alignment files
MIT License
23 stars 3 forks source link

Taffy

This is a MIT licensed C and Python library with a CLI for manipulating/reading/writing TAF (described below) and MAF format multiple sequence alignments. It allows conversion between the formats and manipulation of the alignments with a number of useful utilities for preparing them for different use cases. The Python library is built on top of the C library and is therefore quite fast.

Taf Format Specification

See the Taf format page for a specification of the taf format and example.

Installation

See C/CLI Install for how to build and install this source for using the C library and CLI utilities.

See Python install for how to install the Python library.

CLI Utilities

See taffy utilities for a description of the many useful taffy utilities, including:

Using the Python API

See using the Python API for how to work with MAF/TAF alignments using a convenient Python API designed to complement the CLI.

See the example notebook for a quick worked example of using the Python API for machine learning with PyTorch.

Using the C Library

There is also a simple C library for working with taf/maf files. See taf.h in the inc directory.

Comparing MAF and TAF file sizes

See quick file size comparison.