EBISPOT / goci

GWAS Catalog Ontology and Curation Infrastructure
Apache License 2.0
26 stars 19 forks source link

yaml metadata wrong - '66b35d68f8167b0001a83512' #1490

Open earlEBI opened 1 week ago

earlEBI commented 1 week ago

Some of the metadata in the yaml file for these two studies does not match submitted data: GCST90446169 GCST90446168

yamls: GCST90446168-yaml.txt GCST90446169-yaml.txt

E.g., the author notes differ between both yamls and in both cases should be empty (from template). Some other fields look wrong including: date_metadata_last_modified adjusted_covariates

while other fields like reported trait look right.

Please investigate the cause and correct these yamls and the harmonised yamls.

earlEBI commented 3 days ago

We've been contacted by a user who has reported the same issue for their publication (39349817) This suggests the issue may be more widespread. I will let the user know we are investigating.

karatugo commented 2 days ago

Replicated the error with the following scenario:

  1. Mark GCST90429849, GCST90446168 and GCST90446169 as pending.
  2. Run the Python script generate_yaml in the conda env gss-327.
  3. Check their generated yaml files in sandbox staging to see author notes got mixed up.
earlEBI commented 19 hours ago

another example: submission id '66e1ce1ae6802e0001eb60ea'