Closed alimanfoo closed 3 years ago
Could you add an option to zip up the zarr? I think it's easier to output single (zip) files from a cromwell workflow than globbing the zarr tree structure.
Could you add an option to zip up the zarr? I think it's easier to output single (zip) files from a cromwell workflow than globbing the zarr tree structure.
Hi @gbggrant, we actually would rather not zip up the zarr, with a larger multi-sample callset like this it is much easier to work with unzipped, and we likely will copy it straight up to GCS as-is. Is there a workaround to make Cromwell happy with this kind of output?
P.S., season's greetings.
@alimanfoo I'll have another look at this. I'm doing something wrong right now, so wasn't getting anything with my glob.
Have a peaceful holiday!
This PR adds a cohort_vcf_to_zarr script for converting a multi-sample VCF file to zarr. Example usage with VCF from output of phasing pipeline:
Also includes moving the sample_vcf_to_zarr script into its own directory for consistency of file organisation.
Work towards #44.