sgkit-dev / bio2zarr

Convert bioinformatics file formats to Zarr
Apache License 2.0
28 stars 7 forks source link

Track missingness count for each field in ICF #131

Open jeromekelleher opened 7 months ago

jeromekelleher commented 7 months ago

Currently the summary doesn't tell us if the field is just missing values. We should be able to increment a counter for every non-missing value.

There's some subtlety here about how hard we should work to track the level of missingness within arrays that do get stored for INFO and FORMAT fields.