Daniel-Mietchen / ideas

A dumping ground for halfbaked ideas, some of which will hopefully be worked on soon
Other
26 stars 6 forks source link

Write a piece "Towards a universal validator of identifiers used in research" #1679

Open Daniel-Mietchen opened 2 years ago

Daniel-Mietchen commented 2 years ago

Gist:

Examples I have come across:

Daniel-Mietchen commented 2 years ago

I have started a collection of cases where various identifiers have been used incorrectly: https://github.com/br2s/bug-reports-to-science/blob/master/problem-categories/identifiers/generic.md .

I have also begun to look into use cases where this might matter, e.g. A botanical demonstration of the potential of linking data using unique identifiers for people

Daniel-Mietchen commented 2 years ago

Some further examples from Functionathon: a manual data mining workflow to generate functional hypotheses for uncharacterized human proteins and its application by undergraduate students:

The tutors identified a mistake in mouse genome databases (fam90a1b wrongly annotated as a CXorf58 ortholog) and one in the literature (TMEM232 wrongly described as a tetraspan protein) that would have been difficult for the students to spot. While MGI was notified and the mistake will be corrected, mistakes in the literature are generally not corrected.