liulab-dfci / MAESTRO

Single-cell Transcriptome and Regulome Analysis Pipeline
GNU General Public License v3.0
278 stars 78 forks source link

Support for gzipped FASTQ input in `sc_atac_preprocess` #171

Open Billsfriend opened 1 month ago

Billsfriend commented 1 month ago

Hello,

I’ve noticed that during the sc_atac_preprocess step, all FASTQ files in .fastq.gz format are being decompressed into a new location, which consumes a significant amount of disk space. However, since Chromap supports gzipped input, it would be more efficient to retain the FASTQ files in their compressed format.

Could we implement a way to process fastq.gz files directly without the need for decompression? This adjustment would greatly optimize storage requirements.