immcantation / presto

pRESTO is part of the Immcantation analysis framework for Adaptive Immune Receptor Repertoire sequencing (AIRR-seq). pRESTO is a bioinformatics toolkit for processing high-throughput lymphocyte receptor sequencing data.
https://presto.readthedocs.io
GNU Affero General Public License v3.0
0 stars 0 forks source link

Allow AssemblePairs-reference to use gapped references #46

Closed ssnn-airr closed 7 years ago

ssnn-airr commented 7 years ago

Original report by Jason Vander Heiden (Bitbucket: javh, GitHub: javh).


Should be able to ungap the reference sequences, build the usearch or blastn database, and then pass the database into the alignment tool, instead of requiring users to maintain a set of both a gapped and ungapped references.

ssnn-airr commented 7 years ago

Original comment by Jason Vander Heiden (Bitbucket: javh, GitHub: javh).


Modifications to rebuild database are done. Just need to ungap sequence upon load, test what usearch and makeblastdb do with gapped references, and pass the build through an ungapped temp file, if required.

ssnn-airr commented 7 years ago

Original comment by Jason Vander Heiden (Bitbucket: javh, GitHub: javh).


Done in 7cf3180 via reference file preprocessing step added to presto.Applications.makeUBlastDb and presto.Applications.makeBlastnDb.