wcmc-its / ReCiter

ReCiter: an enterprise open source author disambiguation system for academic institutions
Apache License 2.0
46 stars 25 forks source link

Output score when any author has affiliation of an institution that commonly co-authors papers with target author's home institution #192

Closed paulalbert1 closed 6 years ago

paulalbert1 commented 6 years ago

WCM faculty are likely to co-author papers with people from certain institutions, e.g., Hospital for Special Surgery. We should score candidate articles more highly when any author has one of these affiliations. We should also try to accommodate cases where an institution does not have a Scopus license.

Current approach

Currently, this strategy is only used when a user has > 2,000 candidate articles, but there's no reason it couldn't be used for everyone.

Proposed approach

Look up records as part of a separate retrieval process. This way PubMed does the heavy lifting for us. Any returned articles get added to the esearchresult table with the commonInstitutionalAffiliationStrategy.

PubMed-only method

Used in cases where an admin does not have a Scopus license.

((new york) OR 10065 OR 10021 OR weill OR cornell OR (newyork AND presbyterian) OR (new york AND presbyterian) OR HSS OR (hospital special surgery) OR (North Shore hospital) OR (Long Island Jewish) OR (memorial sloan) OR (sloan kettering) OR sloan-kettering OR hamad OR (mount sinai) OR (methodist houston) OR (National Institute of Mental Health) OR (beth israel) OR (University of Pennsylvania Medicine) OR (Merck Research) OR (New York Medical College) OR (Medicine Dentistry New Jersey) OR Montefiore OR (Lenox Hill) OR (Cold Spring Harbor) OR (St. Luke's-Roosevelt) OR (New York University Medicine) OR Langone OR (SUNY Downstate) OR (Albert Einstein Medicine) OR Yeshiva OR UMDNJ OR Icahn Medicine OR (Mount Sinai) OR (columbia medical) OR (columbia physicians)) 

Scopus method

Use when institution does have a Scopus license. Any affiliation is one of the following AF-IDs.

paulalbert1 commented 6 years ago

Will be handled by #47