DESeq2 is designed to normalize for library size differences. Summing technical replicates is akin to having a higher sequencing depth for a sample.
Implementation Suggested
Encode Technical Replicate Groups in the Runsheet
Encode technical replicates as a column in the runsheet simply using integers for each technical replicate group.
Eventually, this technical replicate column should be automatically derived from ISA archive metadata; however, in the meantime, a workflow user should be able to supply a two column csv mapping sample name to technical replicate group which will be incorporated into the runsheet.
Use Technical Replicate Groups Column in Runsheet to for DESeq2 collapseReplicates
Description
Workflow should handle technical replicates appropriately.
Approaches
DESeq2 provides a collapseReplicates function that sums counts based on a factor to group samples by. The rationale has two major points:
Implementation Suggested
Encode Technical Replicate Groups in the Runsheet
Encode technical replicates as a column in the runsheet simply using integers for each technical replicate group. Eventually, this technical replicate column should be automatically derived from ISA archive metadata; however, in the meantime, a workflow user should be able to supply a two column csv mapping sample name to technical replicate group which will be incorporated into the runsheet.
Use Technical Replicate Groups Column in Runsheet to for DESeq2 collapseReplicates
https://rdrr.io/bioc/DESeq2/man/collapseReplicates.html
Validation Plan
Run the following approaches
Assessment Metrics: