Arcadia-Science / prehgt

A pipeline for lightweight screening of Eukaryotic genomes and transcriptomes for recent HGT
MIT License
12 stars 6 forks source link

Add new specificity indices #27

Closed taylorreiter closed 1 year ago

taylorreiter commented 1 year ago

because donor distribution index is weird and not behaving like I thought it would. Added entropy, normalized entropy, and gini coefficient.

~I want to test this in the nextflow workflow before review and merge, but I built this PR from main. So wait until #24 is merged to main, then merge into this PR and test.~ Tested with #25 instead bc test data is no longer available for #24. Tests pass, so once #25 is merged this will be ready for review.