BeeCSI-Microbiome / R_analyses

1 stars 0 forks source link

Handle "unclassified X" #8

Closed LLansing closed 2 years ago

LLansing commented 2 years ago

There are some taxa that are the format unclassified X, such as unclassified Lactobacillus. We need to determine what distinguishes these taxa from the parent taxa (e.g. how are reads classified as unclassified Lactobacillus different from reads classified as the genus Lactobacillus?)

Within our framework, these reads are going unanalyzed because these taxa do not have an assigned rank (such as G for genus), but they do have reads assigned to them.

Proposed solution: drop those taxa and add their reads to the relevant parent taxa

LLansing commented 2 years ago

The solution should be generalized easily, but these are some taxa seen so far

Also worth noting, these taxa may have all NA values (in the case that they have no reads assigned to them

LLansing commented 2 years ago

Generalized solution via function _scripts/initial_processing.R$aggregateunclassified()