openeventdata / petrarch2

Another next-generation event coding platform.
MIT License
71 stars 42 forks source link

Adapting new Treebank format #50

Open InzamamAnwar opened 8 months ago

InzamamAnwar commented 8 months ago

Hi,

I was trying to use PETRARCH2 with newer version of CoreNLP specially Stanza. It also includes new pipeline which is robust and based on Neural Networks.

The parse tree is somewhat different from the one available from CoreNLP Docker available within this repo. An example is given below. Depending on small differences, I cannot get events from the new parsed tree format.

Original Text

Winterfell has asked the Lannister families to clarify those issues but Beijing has not yet straightened them out, he said, adding that Japan would continue to talk to China about this.

Parsed Output from included CoreNLP Docker

(S (S (NP (NNP WINTERFELL )  )  (VP (VP (VBZ HAS )  (VP (VBN ASKED )  (NP (DT THE )  (NNP LANNISTER )  (NNS FAMILIES )  )  (S (VP (TO TO )  (VP (VB CLARIFY )  (NP (DT THOSE )  (NNS ISSUES )  )  )  )  )  )  )  (CC BUT )  (S (NP (NNP BEIJING )  )  (VP (VBZ HAS )  (RB NOT )  (ADVP (RB YET )  )  (VP (VBD STRAIGHTENED )  (NP (PRP THEM )  )  (PRT (RP OUT )  )  )  )  )  )  )  (PRN (, , )  (S (NP (PRP HE )  )  (VP (VBD SAID )  )  )  (, , )  )  (S (VP (VBG ADDING )  (SBAR (IN THAT )  (S (NP (NNP JAPAN )  )  (VP (MD WOULD )  (VP (VB CONTINUE )  (S (VP (TO TO )  (VP (VB TALK )  (PP (TO TO )  (NP (NNP CHINA )  )  )  (PP (IN ABOUT )  (NP (DT THIS )  )  )  )  )  )  )  )  )  )  )  )  (. . )  )  

Paresed Output from Stanza CoreNLP CLient

(S (S (S (NP (NNP WINTERFELL )  )  (VP (VBZ HAS )  (VP (VBN ASKED )  (NP (DT THE )  (NNP LANNISTER )  (NNS FAMILIES )  )  (S (VP (TO TO )  (VP (VB CLARIFY )  (NP (DT THOSE )  (NNS ISSUES )  )  )  )  )  )  )  )  (CC BUT )  (S (NP (NNP BEIJING )  )  (VP (VBZ HAS )  (RB NOT )  (ADVP (RB YET )  )  (VP (VBN STRAIGHTENED )  (NP (PRP THEM )  )  (PRT (RP OUT )  )  )  )  )  )  (, , )  (NP (PRP HE )  )  (VP (VBD SAID )  (, , )  (S (VP (VBG ADDING )  (SBAR (IN THAT )  (S (NP (NNP JAPAN )  )  (VP (MD WOULD )  (VP (VB CONTINUE )  (S (VP (TO TO )  (VP (VB TALK )  (PP (IN TO )  (NP (NNP CHINA )  )  )  (PP (IN ABOUT )  (NP (DT THIS )  )  )  )  )  )  )  )  )  )  )  )  )  (. . )  )  

Could you please highlight how can I use new Treebank format with PETRARCH2.