googlegenomics / gcp-variant-transforms

GCP Variant Transforms
Apache License 2.0
134 stars 55 forks source link

Add an option to disable sharding #679

Open arostamianfar opened 3 years ago

arostamianfar commented 3 years ago

The new version of Variant Transforms "forces" sharding. While sharding is definitely useful in some situations, it would be good to add an option to disable sharding. Example use cases:

I 'hacked' the code in our fork to make this work, but it also seems feasible to merge a version of this upstream? It's missing the residual partition (which we don't have in our use case), but it may not be too difficult to include that edge case as well.

Not sure if any other users have asked for this feature, but please let me know your thoughts and happy to discuss further :)