ClinGen / gene-and-variant-curation-tools

ClinGen's gene and variant curation interfaces (GCI & VCI). Developed by Stanford ClinGen team.
https://curation.clinicalgenome.org/
MIT License
3 stars 1 forks source link

Google Scholar Search String #306

Open wrightmw opened 1 year ago

wrightmw commented 1 year ago

https://broadinstitute.atlassian.net/browse/CGSP-530

Based on the MANE Select transcript, provide a search string that curators can use to search for publications.

See Table 2, on page 14 in the VCI SOP for search string suggestions based on each transcript type (pasted below too): https://clinicalgenome.org/site/assets/files/7438/variant_curation_sop_v3_2_oct_2022.pdf

Screenshot 2023-01-26 at 9 53 27 PM

In the header, we would provide the ability to copy the search string to their clipboard, and, if possible, to provide a link directly to Google Scholar that would automatically take them to Google Scholar with the search string pasted in as a search.

justyneross commented 1 year ago

Here is an example of how I follow, and expand upon, this pattern for generating search terms.

For NM_000419.5:c.1063G>A (p.Glu355Lys) my search term was:

(ITGA2B OR GPIIb OR CD41) AND ("1063G>A" OR "1063G/A" OR "G1063A" OR "Glu355Lys" OR "E355K" OR "Glu324Lys" OR "E324K" OR "rs137852910" OR "44383640" OR "42461008")

It includes aliases for the gene name, two alternate ways that authors sometimes write SNVs (especially in older literature), the protein change both in the current numbering system and a previous numbering system commonly used for this protein, the rs ID, and genomic location in both GRCh 37 and 38

marinadistefano commented 1 year ago

Here is an example of the email search string from Alamut for the same variant Justyne included:

"ITGA2B" ("1063G>A" | "1063G->A" | "1063G-->A" | "1063G/A" | "Glu355Lys" | "E355K" | "rs137852910")

On Mon, Jan 30, 2023 at 4:00 PM justyneross @.***> wrote:

Here is an example of how I follow, and expand upon, this pattern for generating search terms.

For NM_000419.5:c.1063G>A (p.Glu355Lys) my search term was:

(ITGA2B OR GPIIb OR CD41) AND ("1063G>A" OR "1063G/A" OR "G1063A" OR "Glu355Lys" OR "E355K" OR "Glu324Lys" OR "E324K" OR "rs137852910" OR "44383640" OR "42461008")

It includes aliases for the gene name, two alternate ways that authors sometimes write SNVs (especially in older literature), the protein change both in the current numbering system and a previous numbering system commonly used for this protein, the rs ID, and genomic location in both GRCh 37 and 38

— Reply to this email directly, view it on GitHub https://github.com/ClinGen/gene-and-variant-curation-tools/issues/306#issuecomment-1409325430, or unsubscribe https://github.com/notifications/unsubscribe-auth/AHFKA6RI2XCH2BXJGNWPMGLWVATXPANCNFSM6AAAAAAUIKFBZE . You are receiving this because you were assigned.Message ID: @.***>

--

Marina DiStefano, Ph.D., DABMGG, FACMG | Associate Director

Clinical R&D Sequencing Platform

The Broad Institute

(Remote from Pennsylvania)

Email: @.***

she/her/hers