AlexsLemonade / scpca-nf

scpca-nf is the Nextflow workflow for processing Single-cell Pediatric Cancer Atlas Portal data
BSD 3-Clause "New" or "Revised" License
12 stars 2 forks source link

Delete duplicate layer in AnnData objects #759

Closed jashapiro closed 4 months ago

jashapiro commented 4 months ago

When working on OpenScPCA-nf, I found that when we move raw data in the processed data, we are leaving in place the original "logcounts" layer, making it meaning we have two copies of it around.

This PR removes that duplication (and merges in changes from main to development)

Unless we were supposed to leave it for some reason, I think it should save us some space! By my metrics (and logical expectation, nice when they match) this saves ~1/3 of the space for the AnnData processed objects.

I couldn't find any reference to the layers in our docs, so this change should not require any updates there.