Chromosome-level genome assembler combining multiple de novo assemblies
To install IMAP,
1. Download source
git clone https://github.com/jkimlab/IMAP.git
2. Check & install the required perl libraries
- Check the required perl libraries
./build.pl --check
- Install the required perl libraries
3. Install IMAP package
./build.pl --install
To uninstall IMAP,
./build.pl --uninstall
To install IMAP,
1. Install docker (https://docs.docker.com/install/linux/docker-ce/ubuntu)
curl -fsSL https://get.docker.com/ | sudo sh
sudo usermod -aG docker $USER # adding user to the “docker” group
2. Download source
git clone https://github.com/jkimlab/IMAP.git
3. Build image using Dockerfile
- Change to the directory where Dockerfile is located.
docker build -t [image_name] .
4. Run by docker
- Run image and create container
docker run -it [image_name] /bin/bash
Required approximately 60GB empty disk space
./build.pl --example cd IMAP_EX bash CMD
To run IMAP, you need to prepare a parameter file.
parameter file
You can use multiple sequencing read libraries, and one or more outgroup species.
############################################################
############################################################
[LIB] insertSize [(integer) insert size] insertSizeSD [(integer) SD of insert size] p1 path of forward read (ex. [path]/read1.1.fq) p2 path of reverse read (ex. [path]/read1.2.fq) p1 path of forward read (ex. [path]/read2.1.fq) p2 path of reverse read (ex. [path]/read2.2.fq)
[LIB] insertSize [(integer) insert size] insertSizeSD [(integer) SD of insert size] m1 path of forward read (ex. [path]/read3.1.fq.gz) m2 path of reverse read (ex. [path]/read3.2.fq.gz)
############################################################
############################################################
MinContigLength [(integer) minimum length of contigs]
Kmer [(integer) kmer]
MaxReadLength [(integer) maximum read length]
############################################################
############################################################
Reference [(string) name] [path of sequence file (.fa)] (ex. S288C [path]/S288C.fa) Outgroup [(string) name] [path of sequence file (.fa)] (ex. dairenensis [path]/Saccharomyces_dairenensis.fa)
TREE [path of tree (must be in newick format)] (ex. [path]/tree.nwk)
Resolution [(integer) synteny resolution]
############################################################
############################################################ IterationNumber [(integer) the number of iteration]
Then, you can use the 'IMAP' perl script.
Usage: ./IMAP.pl -t [threads] -p [parameter file] -o [out directory]
Options:
--threads|-t <integer> Number of threads (default: 1)
--params|-p <filename> Parameter file
--outdir|-o <filename> Output directory (default: ./IMAP_RESULT)
--help|-h Print usages
Simple examples:
./IMAP.pl -t 40 -p param.txt -o ./IMAP_RESULT
Song G, Lee J, Kim J, Kang S, Lee H, Kwon D, Lee D, Lang GI, Cherry JM, Kim J. Integrative Meta-Assembly Pipeline (IMAP): Chromosome-level genome assembler combining multiple de novo assemblies. PLoS One. 2019 Aug 27;14(8):e0221858. doi: 10.1371/journal.pone.0221858.
Contact
bioinfolabkr@gmail.com