3’ ends of RRBS reads overlapping with genomic MspI sites include non-methylated cytosines introduced through end-repair. These cytosines are not recognized by Trim Galore and are therefore not trimmed but considered during methylation calling. To avoid methylation bias we developed improve-RRBS, which identifies and hides end-repaired cytosines from methylation calling.
improve-RRBS can be installed by pip:
pip install iRRBS
python -m iRRBS.run_irrbs -i <infile> -c <chromsizes> -g <genome> -o <outputfile>
To run improve-RRBS the following input parameters are required in this order:
This guide walks you through the process of testing the iRRBS (IMPROVE-RRBS) analysis tool using a specific genomic data file. Follow these steps to ensure everything is set up and running correctly. Prerequisites
Before you begin, ensure you have the following installed:
Unzip the rn6 chromosome 20 fasta file
The rn6 (Rattus norvegicus) chromosome 20 fasta file comes compressed. Use the following command to decompress it:
gunzip rn6_chr20.fa.gz
With the data prepared, run the iRRBS tool using the command below. Test data can be found in the "test_data" folder. This command specifies the input BAM file, chromosome sizes file, genome fasta file, and the output file names.
python -m iRRBS.run_irrbs -i test_data.bam -c rn6_chr20.chrom.sizes -g rn6_chr20.fa -o test_data_out_test.bam
After running the test, a log file named test_data_out_test.log will be generated. This file contains information about the summary statistics of the analysis.
Please ensure you check the log file for any potential issues or to verify the successful completion of the test. Compare it with the test_data_out.log file from the test_data folder.