Process and validate command line arguments

PoonLab / OpenRDP

An open-source re-implementation of the RDP4 recombination detection program

GNU General Public License v3.0

45 stars 9 forks source link

The main pre-processing steps (common to all sequences) include:

[x] Checking sequences are the same length
[x] Compressing sequences
[x] Categorizing sequences based on number of gaps and variants
[x] Calculating initial pairwise hamming distances

In the original source code, the sequences are divided into chunks of 4 nucleotides and then converted to integers.

Upon installation of the program, the pairwise hamming distance is calculated for all 625 possible combinations of 5 characters (ATGC-) and these values are written to a file. During execution of the program, these values are stored in a lookup table, queried, and used to calculate the hamming distance.

PoonLab / OpenRDP

Process and validate command line arguments #5