cancervariants / therapy-normalization

Services and guidelines for normalizing drug and other therapy terms
https://normalize.cancervariants.org/therapy/
MIT License
10 stars 3 forks source link

Handle experimental drug names #280

Open jsstevenson opened 2 years ago

jsstevenson commented 2 years ago

Drugs are often given company- or target-specific code names, with a prefix followed by an ID number (for example, AstraZeneca uses the prefix AZD for in-development therapeutics -- their COVID vaccine was originally named AZD1222). However, curators inconsistently format these codenames, sometimes using a space (AZD 1222), sometimes a dash (AZD-1222), and sometimes no separator (AZD1222), and our sources may not include each possible form as aliases. We should assemble a list of such prefixes and ID structures and enable a check for matches under each possible form if a user's query looks like it conforms to this structure. This behavior could look similar to the current namespace inferrence that we can perform for some kinds of concept IDs.

jsstevenson commented 1 year ago

Wikipedia provides a good list: https://en.m.wikipedia.org/wiki/List_of_pharmaceutical_compound_number_prefixes