fls-bioinformatics-core / auto_process_ngs

Scripts and utilities for automatic processing & management of Illumina NGS sequencing data.
Other
9 stars 6 forks source link

manage_fastqs.py: enable Fastqs to be batched into multiple ZIP files #867

Closed pjbriggs closed 1 year ago

pjbriggs commented 1 year ago

Updates the manage_fastqs.py utility to enable Fastqs to be batched into multiple ZIP files, by specifying a maximum size of each ZIP using the new --max_zip_size option.

When batching is enabled then manage_fastqs.py will create one or more ZIP archives each of which should not exceed the specified maximum size. If no limit is specified then a single ZIP archive will be created as before.

This feature is intended to allow bundling of Fastqs into chunks which can then be transferred using mechanisms that impose a size limit on individual file transfers.