martinfthomsen / rucs2

2 stars 0 forks source link

K-mers with non-compliant charspace are included #9

Closed martinfthomsen closed 1 year ago

martinfthomsen commented 1 year ago

Update kmer-extraction fct to ignore kmers with invalid characters. ie characters outside the chosen charspace.

Characters other than ATGC, such as UIPAC, are currently not handled properly. thus, the use of k-mers with these characters adds noice to the analysis.

Please provide at least a way of ignoring these non-compliant k-mers.