ENCODE-DCC / atac-seq-pipeline

ENCODE ATAC-seq pipeline
MIT License
389 stars 174 forks source link

Memory Saving: too many large files? #428

Open Rafaelsoler13 opened 1 year ago

Rafaelsoler13 commented 1 year ago

Hello,

I am having a problem with pipeline file sizes and memory usage. When the pipeline ends, the folder with all the files is around 600GB for 4 fastq samples of around 25GB. Is that normal? Because I have to run 32 samples, and in total it would take up almost 5TB of disk space.

What files can I delete to reduce the space it takes up? (For example the files contained in the inputs folders). Is there a way in the pipeline to reduce the number of files saved to disk?

Thank you so much,

Rafael