nanoporetech / pore-c

Pore-C support
Mozilla Public License 2.0
53 stars 6 forks source link

.


We have a new bioinformatic resource that largely replaces the functionality of this project! See our new repository here: https://github.com/epi2me-labs/pore-c-py.

We recommend that you use the tool through the associated nextflow pipeline: https://github.com/epi2me-labs/wf-pore-c.

This repository is now unsupported and we do not recommend its use. Please contact Oxford Nanopore: support@nanoporetech.com for help with your application if it is not possible to upgrade to our new resources, or we are missing key features.


Pore-C tools

This package is designed to analyse the data from multi-contact pore-C reads. It is similar to the pairtools package in scope, however it is specifically designed to handle multi-contact reads (aka c-walks). It carries out the following operations:

The recommended way to run Pore-C tools is through the associated Pore-C-Snakemake pipeline. It wraps the pore-C commands and also handles some of the analysis steps outside the scope of pore-C tools such as read alignment and conversion of output files to .cool format. If you want to run the tools outside of this pipeline please see the section below on

Sample data

There are some sample datasets (fastq, alignment parquets, .pairs, .cool files) available for HindIII-digested HG002 (31Gb) here and for NlaIII-digested GM12878 (23Gb) here.

Development

The recommended way to use pore-C tools is through the associated Pore-C-Snakemake pipeline, however if you want to run tools outside of this context you'll need to set up a development environment. We use the tox automation tool with the tox-conda extension to create conda environments for development. Several environments are defined in the tox.ini file:

For each of these environments a conda environment is created at .tox/<env_id>. You can activate these for interactive work with conda activate .tox/<env_id>.

Developing the Snakemake pipeline

If you're working on the Pore-C-Snakemake pipeline:

git clone git@github.com:nanoporetech/pore-c.git
cd pore-c
tox -e snakemake --notest

cd ../
git clone git@github.com:nanoporetech/Pore-C-Snakemake.git
cd pore-c-snakemake
conda activate ../pore-c/.tox/snakemake
snakemake -j 8 test --use-conda --config pore_c_version=dev

Citation

The biorxiv pre-print describing Pore-C can be found here:

Nanopore sequencing of DNA concatemers reveals higher-order features of chromatin structure