RaPID is an ultra-fast tool for the identification of identity-by-descent segments among genotyped individuals. Simulation results proved that our tool (RaPID) achieves almost linear scaling up to sample size and is orders of magnitude faster than GERMLINE. At the same time, RaPID maintains a detection power and accuracy superiour or comparable to existing mainstream algorithms, GERMLINE and IBDseq. With our tool, it is feasible to identify IBDs in biobank-scale cohorts.
For technical issues and bug report, please use GitHub issues.
Authors: Shaojie Zhang shzhang@cs.ucf.edu, Degui Zhi Degui.Zhi@uth.tmc.edu.
All versions of RaPID are under the MIT License. Copyright (c) 2016-2021 The University of Texas Health Science Center at Houston and The University of Central Florida.
Minimum cut-off for RaPID v.1.7 is provided only in terms of cM. The input file for this version is compressed VCF file.
Usage:
./RaPID_v.1.7 -i <input_file_vcf_compressed> -g <genetic_mapping_file> -d <min_length_in_cM> -o <output_folder> -w <window_size> -r <#runs> -s <#success>
You can find an example of a panel with 4000 haplotypes with the chromosome length of 10 Mbps _4k1e7.vcf.gz. The file _4k_1e7e0.001.g contains the genetic mapping for the example file.
To run the program for detecting the IBDs with a minimum length of 5 cM type:
./RaPID_v.1.7 -i 4k_1e7.vcf.gz -g 4k_1e7_e0.001.g -d 5 -w 250 -r 10 -s 2 -o output_folder
The program outputs all detected IBD segments in the following format: