inrae / pgd-mmdt

Metadata Management Tool for Dataset
GNU General Public License v3.0
10 stars 1 forks source link

Dictionaries enhancement #6

Open AmelieMasson opened 1 month ago

AmelieMasson commented 1 month ago

Hello,

I would like to know how Maggot’s user community feeds and enhances their dictionaries, mostly the dictionary for the people. Given the high number of people working in a facility, it is not possible to fill in all the required information by hand. Therefore, how do you fill the dictionaries when a big amount of data is implied? Which methods do you use?

Question for Maggot’s team: Is it possible to connect Maggot to Orcid to fill the people fields? For instance, this functionality is available on DMP. Opidor.

Thank you in advance for you your help

djacob65 commented 1 month ago

Hi @AmelieMasson,

Currently, the best is to fill the people dictionary as the metadata related to the datasets is collected. If we want to automate this, then we should extract the data from a suitable source (e.g. internal directory).

Concerning ORCID, the idea seems good, yes but! It will depend enormously on how and with what care the information (metadata) has been filled in ORCID.

For example, by using the public ORCID API (see https://info.orcid.org/documentation/api-tutorials/api-tutorial-searching-the-orcid-registry/), I submit the following query:

q="given-names:daniel+AND+family-name:jacob&fl=orcid,given-names,family-name,email,current-institution-affiliation-name"

I get this:

orcid,given-names,family-name,email,current-institution-affiliation-name 0000-0001-8108-6552,Daniel,Jacob,, 0000-0002-5754-5445,Daniel,Jacob,,"University of Uyo Faculty of Agriculture,University of Uyo" 0009-0003-1805-7136,Daniel,Jacob,, 0000-0003-1888-0918,Daniel,Jacob,,"Albert-Ludwigs-Universität Freiburg,Universitaet Freiburg" 0000-0002-6373-3100,Daniel,Jacob,,Harvard University 0000-0002-6687-7169,Daniel,Jacob,daniel.jacob@inrae.fr,"École Nationale Supérieure de l'Électronique et de ses Applications,BIA" 0000-0002-4032-2229,Jean Daniel,Jacob,, 0009-0007-7372-2812,Daniel,Jacob da Circuncisão,,Universidade Federal de Minas Gerais

My ORCID is 0000-0002-6687-7169. Why does it put my current-institution-affiliation-name as that of my engineering school? So it's hard to work like this at the moment.

One way would be to make sure that all people in ORCID are affiliated with either a RINGGOLD ID or a correct ROR ID that unambiguously identifies institution-affiliation-name.

For example, I am associated with the RINGGOLD ID: 113907 (see my ORCID record)

when I submit the following query:

q="ringgold-org-id:113907&fl=orcid,given-names,family-name,email,current-institution-affiliation-name"

I don't appear there!! :(

I think we cannot rely on this, even if DMP OPIDOR does so!!

Daniel

djacob65 commented 1 month ago

@AmelieMasson,

Nevertheless I will see to implement it anyway despite all the reservations mentioned in the previous post. Indeed, on names less frequent than mine, it returns sometimes only one name. It's worth implementing even if it gives something once twice.

Still to be done... ;-)

AmelieMasson commented 3 weeks ago

Hi @djacob65, Thanks a lot for these indications. We found out with my colleagues that RINGGOLD isn't optimal. We will see what can we do with a suitable source.