ebi-ait / hca-ebi-wrangler-central

This repo is for tracking work related to wrangling datasets for the HCA, associated tasks and for maintaining related documentation.
https://ebi-ait.github.io/hca-ebi-wrangler-central/
Apache License 2.0
7 stars 2 forks source link

hca roadmap - post meeting questions #1315

Open arschat opened 3 weeks ago

arschat commented 3 weeks ago

After our HCA Roadmap 2024 we came up with questions to specific people before we try to prioratize our tasks and goals.

  1. Tony: In core milestone 1 which costs are we aiming to reduce? Do we have a target?
  2. John: Have we reached a decision on how to lower egress costs for the Data Portal? does it include download from ENA? bonus --> will it cover lattice datasets as well
  3. John: Is there a case where we want to make metadata MA even if contributors have consent to release it as OA? Is there a minimum standard for metadata protection in HCA policies?
  4. John/Lucia: What is the expectation around tier 2 and the atlas object? Will the atlas include tier 2? Would that make the atlas MA?
  5. Jason: Is there any published cxg dataset with the full tier 1 metadata?
  6. Jason + Lucia: Is there any network close to publicate T1 metadata?
  7. Jason + Lucia: If datasets wrangled by lattice have been selected and need Tier 2 addition, how is it going to be handled? Does the lattice team have/ can have DPA?
  8. Bobbie: Can we have a meeting to discuss what the partial updates are going to address?
  9. Ida/Arsenios: How many projects from w1 & w2 are in a complete status? If a lot of them are we need a solution for updating files
arschat commented 2 weeks ago

About Q9: There are 73 projects in complete state. Out of those:

idazucchi commented 2 weeks ago

Q8: working on the partial updates is not Bobbie's priority now but she shared the notes from the last discussion that was held around changing the importer's behaviour and the scenarios are broken down in this document in theory scenario 2 should cover the case where file descriptors are presnt but the data files themselves aren't