wcmc-its / ReCiter

ReCiter: an enterprise open source author disambiguation system for academic institutions
Apache License 2.0
45 stars 23 forks source link

Don't double-count institutions in institutionalScoringStrategy #360

Closed paulalbert1 closed 5 years ago

paulalbert1 commented 5 years ago

For targetAuthorScore institution match, we should match only from one source: Scopus, if available, and PubMed as a backup. Here, we're counting both Scopus and PubMed as a match.

If there is a POSITIVE_MATCH for Scopus, continue to output the PubMed results, BUT set the PubMed match score to 0.

e.g., ajg9004, 30674375

                "affiliationEvidence": {
                    "scopusTargetAuthorAffiliation": [
                        {
                            "targetAuthorInstitutionalAffiliationSource": "SCOPUS",
                            "targetAuthorInstitutionalAffiliationIdentity": "Johns Hopkins University",
                            "targetAuthorInstitutionalAffiliationArticleScopusLabel": "Johns Hopkins University",
                            "targetAuthorInstitutionalAffiliationArticleScopusAffiliationId": 60005248,
                            "targetAuthorInstitutionalAffiliationMatchType": "POSITIVE_MATCH_INDIVIDUAL",
                            "targetAuthorInstitutionalAffiliationMatchTypeScore": 3
                        },
                        {
                            "targetAuthorInstitutionalAffiliationSource": "SCOPUS",
                            "targetAuthorInstitutionalAffiliationIdentity": "The Johns Hopkins School of Medicine",
                            "targetAuthorInstitutionalAffiliationArticleScopusLabel": "The Johns Hopkins School of Medicine",
                            "targetAuthorInstitutionalAffiliationArticleScopusAffiliationId": 60001117,
                            "targetAuthorInstitutionalAffiliationMatchType": "POSITIVE_MATCH_INDIVIDUAL",
                            "targetAuthorInstitutionalAffiliationMatchTypeScore": 3
                        }
                    ],
                    "pubmedTargetAuthorAffiliation": {
                        "targetAuthorInstitutionalAffiliationSource": "PUBMED",
                        "targetAuthorInstitutionalAffiliationIdentity": "The Johns Hopkins University School of Medicine",
                        "targetAuthorInstitutionalAffiliationArticlePubmedLabel": "Johns Hopkins UniversityByramjee Jeejeebhoy Medical College Clinical Trials Unit Pune India Johns Hopkins School Medicine Baltimore Maryland.",
                        "targetAuthorInstitutionalAffiliationMatchType": "POSITIVE_MATCH_INDIVIDUAL",
                        "targetAuthorInstitutionalAffiliationMatchTypeScore": 3
                    },
ajg9004-30674375
paulalbert1 commented 5 years ago

By all appearances, this is fixed.