NIAID-Data-Ecosystem / nde-crawlers

Harvesting infrastructure to collect and standardize dataset and computational tool metadata
Apache License 2.0
0 stars 0 forks source link

[Metadata Improvement]: Fix NIAID SysBio metadata in DDE #131

Closed gtsueng closed 1 month ago

gtsueng commented 2 months ago

Issue Name

Fix NIAID SysBio metadata in DDE

Issue Description

This is more of a NIAID SysBio issue than an NDE issue, but it affects our ability to use SysBio as a compelling use case/example

The NDE parser to ingested data from the DDE currently delineates between NIAID SysBio, NIAID Data Ecosystem, and other Data Discovery Engine ingested records based on the context file, however many batch-uploaded SysBio records do NOT have an @context object in the record. This causes it to be attributed only to the Data Discover Engine, and NOT NIAID SysBio

To do:

Issue Discussion

The issue with the NIAID SysBio data was discussed at the Biweekly meeting dated 2024.04.03

Please select the type of metadata improvement

Meta URL

https://github.com/gtsueng/sysbio_dde_fixes/tree/main/data

Related WBS task

https://github.com/NIAID-Data-Ecosystem/nde-roadmap/issues/12

For internal use only. Assignee, please select the status of this issue

Status Description

No response

Request status check list

DylanWelzel commented 2 months ago

It seems that my account can only edit datasets uploaded by me. I only have 17 datasets available to edit and I cannot override the existing datasets with a bulk upload. Laura's account was the one responsible for uploading these sources originally https://discovery.biothings.io/contributor/flaneuse Screenshot 2024-04-05 at 3 01 13 PM Screenshot 2024-04-05 at 3 02 39 PM

gtsueng commented 2 months ago

Thanks for trying, @DylanWelzel. @newgene is there any way to confirm that these were uploaded to Laura's account and if so, is there a way for us to update them?

newgene commented 2 months ago

@gtsueng @DylanWelzel use /api/dataset/<id>?meta=1 you should now get back_meta field. The _meta.username field indicates the owner of the record. And yes, we can update it on the backend (can ping Marco on this).

gtsueng commented 2 months ago

Thanks @newgene I just checked all the records with missing @context, they are all tied to Laura's account. I'll get Marco's help to fix it.

gtsueng commented 2 months ago

@marcodarko, can you help with this issue?

gtsueng commented 2 months ago

As of 2024.05.02, the Data has been fixed on the DDE. A new build of the data will be generated once a fix has been made to the parser to address https://github.com/NIAID-Data-Ecosystem/nde-portal/issues/237

gtsueng commented 1 month ago

This issue has been marked as pending close out and will be closed after a week if there are no further comments on this issue