AlexsLemonade / scpca-nf

scpca-nf is the Nextflow workflow for processing Single-cell Pediatric Cancer Atlas Portal data
BSD 3-Clause "New" or "Revised" License
12 stars 2 forks source link

Skip processing if no cells remain after empty droplet filtering #738

Closed allyhawkins closed 6 months ago

allyhawkins commented 6 months ago

Closes #682 Closes #735

This PR makes some adjustments to the workflow to account for any libraries that have no cells after removing empty droplets. I took a very similar approach to how we handle objects that have 0 cells after removing low-quality cells in the processed object.

While I was here I also accounted for the sample metadata issue and made sure we read in all columns as characters before adding them to the object.

Questions for reviewers:

allyhawkins commented 6 months ago

The stub workflow was failing because everything was getting flagged as having no cells since they will all be empty files. I updated this to not consider stub files when checking file size after filtering and processing.