clemente-lab / mmeds-meta

A database for storing and analyzing omics data
https://mmeds.org
2 stars 1 forks source link

Investigate LEfSe normalization #424

Closed adamcantor22 closed 1 year ago

adamcantor22 commented 1 year ago

Describe the bug We currently use a normalization value of 1,000,000 for running LEfSe, but it's not clear where this value comes from, other than Jose using it at one point. Need to investigate and figure out if this is the desired value.

cleme commented 1 year ago

The value was used in the original LEfSe tutorial, which does not seem to be online anymore. For now maybe keep this as a parameter with default value of 1M, but consider testing on a couple of datasets how much results differ if we set it to the current LEfSe default value.

cleme commented 1 year ago

Also, this is not a bug: that label should be reserved for issues that break the code.

adamcantor22 commented 1 year ago

image Galaxy has this as a default, recommends it when low values present