Closed jmogarrio closed 1 year ago
@gretaabib will take a look. At a high level, we need better controls on the datasets that the sample workstream produces, ensuring they are well documented, named and tracked. May be data cards - greta will take a look.
Sprint 5
greta to work with @jjcastro and @kazob1998 to develop a holistic approach to managing all data. We will have a design doc created in Sprint 5
changed title, sprint 5 item
This was also missed in our sync with @gretaabib for the same reason, I'll follow up on it next week.
We weren't sure in triage whether this is something that is still necessary, assigning to Luis to assess.
Right now there's some descriptions of the data in comments in the main notebook, as well as some details in this doc.
Ideally, we probably want something like a Data Card for each dataset. This likely involves some collaboration with the Samples workstream, since any new dataset should come with (at least part of) a Data Card.