NIAID-Data-Ecosystem / nde-crawlers

Harvesting infrastructure to collect and standardize dataset and computational tool metadata
Apache License 2.0
0 stars 0 forks source link

De-duplicate / combine overlapping records for Mendeley #40

Closed flaneuse closed 1 year ago

flaneuse commented 2 years ago

Need to think carefully about how to do this, but how do we combine data that are available in multiple indices? For instance: GEO record from NCBI GEO itself, or Omics DI, or Mendeley. Ideally, this would be a single record, but need to figure out how to merge info, resolve conflicts, etc.

flaneuse commented 2 years ago

For Mendeley:

flaneuse commented 1 year ago

Mendeley doesn't expose the data they harvest from other sources, so no need to de-duplicate!