sul-dlss / sul_pub

SUL system for harvest and managing publications for Stanford CAP, with controlled API access.
http://cap.stanford.edu
Other
8 stars 3 forks source link

Investigate use of quotes around names in Pubmed queries #1629

Open peetucket opened 12 months ago

peetucket commented 12 months ago

We do not currently quote the names in Pubmed queries, which can result in some incorrect matches. However, quoting the names can be limiting. See the two examples below.

Perhaps some more investigation is warranted, and perhaps we want to consider making the query a bit more complex, like quoting a series of name variants that we construct given the names we have (similar to what we do for the WoS query).

Example 1: "Chan Leem"

Here is the query we currently run: ((Leem, Chan[Author])) AND (Stanford University[Affiliation])

https://pubmed.ncbi.nlm.nih.gov/?term=((Leem,%20Chan[Author]))%20AND%20(Stanford%20University[Affiliation])

It yields 1 result, incorrect.

Try the same query but with with quotes: (("Leem, Chan"[Author])) AND (Stanford University[Affiliation])

https://pubmed.ncbi.nlm.nih.gov/?term=(("Leem, Chan"[Author])) AND (Stanford University[Affiliation])

No results (correct).

This query works better with quotes.

Example 2: “Karen Casciotti”

Here is the query we currently run: ((Casciotti, Karen[Author])) AND (Stanford University[Affiliation])

https://pubmed.ncbi.nlm.nih.gov/?term=((Casciotti%2C%20Karen%5BAuthor%5D))%20AND%20(Stanford%20University%5BAffiliation%5D)&sort=

It yields 13 results, all correct.

Try the same query but with with quotes: (("Casciotti,%20Karen"[Author]))%20AND%20(Stanford%20University[Affiliation])

https://pubmed.ncbi.nlm.nih.gov/?term=(("Casciotti%2C%20Karen"%5BAuthor%5D))%20AND%20(Stanford%20University%5BAffiliation%5D)&sort=%20AND%20(Stanford%20University%5BAffiliation%5D)&sort=)

It yields only 1 result (still correct, but far more limiting).

This query does not work better with quotes.