NIAID-Data-Ecosystem / nde-crawlers

Harvesting infrastructure to collect and standardize dataset and computational tool metadata
Apache License 2.0
0 stars 1 forks source link

[Mapping Change]: bio.tools mapping improvement #165

Open gtsueng opened 2 months ago

gtsueng commented 2 months ago

Issue Name

bio.tools mapping improvement

Issue Description

The bio.tools parser is out of date. There is a lot of metadata that is currently available on bio.tools, but is missing from the Discovery Portal

Some fields need to be re-mapped: https://biotools.readthedocs.io/en/latest/api_usage_guide.html#tool-attributes

Issue Example

https://bio.tools/disgenet

Map Source Field

Previously, the community behind a bio.tools record was mapped to the topicCategory field. Instead, it should be mapped to the keywords field and the bio.tools topic field should be mapped to the topicCategory field. Note that bio.tools uses the EDAM topics ontology for their topic field, so we should parse it accordingly.

Additionally, bio.tools now has different types of references, and the reference type can be used to help sort how that reference should be mapped.

See: https://docs.google.com/presentation/d/17ZHG9w0eG7kMfV3sxyqz6empNohTvcBgVZzqMt7FaTE/edit#slide=id.p

Map Target Field

See: https://docs.google.com/presentation/d/17ZHG9w0eG7kMfV3sxyqz6empNohTvcBgVZzqMt7FaTE/edit#slide=id.p

Related WBS task

For internal use only. Assignee, please select the status of this issue

Status Description

No response

gtsueng commented 1 month ago

@hartwicka @lisa-mml @rshabman @sudvenk

The Ask: We seek approval to move this ComputationalTool repository to Staging so that it can be used to illustrate changes to the ComputationalTool card and resource page design (pending mock up selection and approvals)