NIAID-Data-Ecosystem / nde-crawlers

Harvesting infrastructure to collect and standardize dataset and computational tool metadata
Apache License 2.0
0 stars 0 forks source link

[Infrastructure] Enable submission of metadata corrections #121

Closed gtsueng closed 5 months ago

gtsueng commented 6 months ago

Background: In replacing the use of PubTator with EXTRACT/Text2Term for infectiousAgent, species, and healthCondition augmentation, there may be more false positives introduced. Our infrastructure already supports the use of dictionary-based normalization of metadata for these fields; however, a correct term extraction for one abstract may be incorrect in another. For this reason, there is a need to enable record-based corrections for augmented metadata. Additionally, the pipeline may have biases which would be desirable to drop altogether.

Examples:

Additional considerations:

gtsueng commented 6 months ago

This is the repository for metadata corrections: https://github.com/NIAID-Data-Ecosystem/nde-metadata-corrections/issues