a tool for annotation of variable (V), diversity (D), and joining (J) immunoglobulin genes in genomes.
IGDetective takes a genome in FASTA format as an input and operates in three stages:
To run IGDetective, type:
python run_iterative_igdetective.py genome.fasta output_dir
Please note that IGDetective overwrites the output directory, so make sure that it does not contain important files.
IGDetective reports three txt files containing information about detected V, D, and J genes from three IG loci: IGH, IGK, and IGL:
output_dir/combined_genes_IGH.txt
output_dir/combined_genes_IGK.txt
output_dir/combined_genes_IGL.txt
Each file represents a table describing detected genes using the following columns:
Examples of the txt files can be found in examples/mDicBic1_igdetective.
The summary plot output_dir/summary.png
illustrates counts of the detected genes across all IG loci and contigs. An example of the summary plot for the black rhinoceros genome (mDicBic1, the maternal assembly) is shown below:
The subdirectory output_dir/position_plots
contains plots showing positions of IG genes found on the same contig. The position plot for the black rhinoceros IGH locus located on the contig SUPER_24
is shown below:
Each bar represents an IG gene, colors distinguish genes of different types:
In this example, the IGH locus is located on the reverse-complementary strand and has the J-D-V ordering of IG genes. Positions of IG genes are shown within a range from the leftmost IG gene to the rightmost IG gene.
We are actively working to make IGDetective better. Please report any bugs to GitHub.
We also welcome your comments and suggestions on IGDetective. Please feel free to send it to Vikram Sirupurapu (vsirupur@ucsd.edu) and/or Yana Safonova (ysafonova@cs.jhu.edu).