amrisi / amr-guidelines

246 stars 87 forks source link

Slang drugs/ substances: NE vs common nouns #69

Closed mgeorgescu closed 10 years ago

mgeorgescu commented 11 years ago

DF data contains different names of drugs:

e.g. LSD, meth, crystal, maryjane

  1. Should drug-related substances be treated as common nouns or NEs?

We would opt for treating them as common nouns.

If we need to treat them as NEs: The concept for the NE should be "drug"/"substance" even if it's not part of the NE list. Or should we opt for "thing"?

:ARG1 (d / drug
              :name (n / name
                      :op1 "LSD"
  1. The slang terms should not be mapped to standard form: maryjane > marijuana

meth > methamphetamine

Note: On issue #68, the approach was to map certain slang expressions to standard form: 'em > they wanna > want

So point 2 could be contradictory to the approach on #68

Any feedback is appreciated. Thank you.

uhermjakob commented 11 years ago

We treat substances as "common nouns", not NEs, and I think that should apply to drugs as well. Named entities (NEs) are individuals, such as a specific animal ("Lassie"), as opposed to a class of entities ("dog"). Consensus substance examples: gold, water, milk.

Remember, to enter upper-case concepts such as LSD in the AMR Editor, enter !LSD (with a !) to overwrite the AMR Editor's default assumption that upper-case items are strings (and not concepts).

I agree that we currently don't normalize synonyms such as methamphetamine, meth, crystal, desoxyephedrine to a canonical form. We do correct typos and expand true abbreviations such as "Dr." -> doctor and "Mass." -> "Massachusetts", but we keep acronyms such as CEO and LSD and we keep clipped forms such as meth, phone, exam, flu, fax, gym instead of expanding to methamphetamine, telephone, examination, influenza, facsimile, gymnasium.

nschneid commented 11 years ago

A pharmaceutical brand name like Tylenol would be product, correct? I guess it makes sense to distinguish those from general kinds of drugs, like aspirin.