propbank / propbank-frames

Lexicon of frame files used by Propbank annotation. A searchable, readable version of the latest release is here: http://propbank.github.io/v3.4.0/frames/
Creative Commons Attribution Share Alike 4.0 International
95 stars 27 forks source link

Difference with respect to AMR roles #1

Open danielhers opened 7 years ago

danielhers commented 7 years ago

I am aware of the fact that AMR has special rolesets ending in -91, but I noticed there is also at least one difference in "normal" rolesets. Specifically, ablate.01 has only roles 0, 1 and 2, but in AMR it also has role number 3. This role is used, for example, in AMR bel_pmid_1222_6756.17894 in the BioAMR data set:

(c2 / cause-01
      :ARG0 (s / suppress-01
            :ARG1 (t2 / transcribe-01
                  :ARG1 p))
      :ARG1 (a / ablate-01
            :ARG1 (e3 / express-03
                  :ARG2 (p / protein :name (n3 / name :op1 "RALT")))
            :ARG3 (m / molecular-physical-entity
                  :ARG0-of (i2 / inhibit-01
                        :ARG1 (e2 / enzyme :name (n2 / name :op1 "MEK-1")))))
      :ARG1-of (i3 / indicate-01
            :ARG0 (a2 / analyze-01
                  :instrument (t3 / thing :name (n4 / name :op1 "Northern" :op2 "blot"))
                  :ARG1-of (p2 / present-01
                        :location (f / figure :mod 3)))))

Are the PropBank frame files out of date, or does AMR use a different set of frames, as opposed to what is said in the README?

timjogorman commented 7 years ago

Hi Daniel, | Are the PropBank frame files out of date, or does AMR use a different set of frames, as opposed to what is said in the README?

AMR uses what amounts to a "fork" of our frames. Since AMR and Propbank are both annotating new data in parallel and keep adding things, they need to be actively kept in sync. We've been working to make sure that any revisions that happen on the AMR side get into our frames, to that it's close to being a subset of ours, but some things can slip through ( and I hadn't yet run any automatic checks on the new BioAMR release.)

| Specifically, ablate.01 has only roles 0, 1 and 2, but in AMR it also has role number 3.

Thanks for noting that! That was a revision to the AMR frames that slipped through our update process; I'm going to run checks on BioAMR and make sure nothing else slipped through, and then make sure these get into the public frames.

(We hope to eventually have a shared set of frames used by both Propbank and AMR, with usage flags for AMR-only and Propbank-only rolesets and roles.)

danielhers commented 7 years ago

Thanks a lot! That answers my question. Should I close the issue or keep it open until the data is synced?

danielhers commented 7 years ago

Another difference: play.11 has ARG3: data carrier (score, record, tape, CD, DVD) in AMR but not in PropBank.

danielhers commented 7 years ago

And raise.02 has ARG3: recipient of money (if different from arg0).