Closed harisont closed 3 years ago
Thanks! If gfud still compiles with these changes, they should be OK. The new RTree module needs to be imported in several places I guess.
Aarne
From: Arianna Masciolini notifications@github.com Sent: Tuesday, March 2, 2021 4:44:11 PM To: GrammaticalFramework/gf-ud Cc: Subscribed Subject: [GrammaticalFramework/gf-ud] various changes for concept alignment (#4)
Here is a summary of the changes:
You can view, comment on, or merge this pull request online at:
https://github.com/GrammaticalFramework/gf-ud/pull/4
Commit Summary
File Changes
Patch Links:
— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHubhttps://github.com/GrammaticalFramework/gf-ud/pull/4, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AAWBQXLKAN7D4NDUIVUNMDLTBUBUXANCNFSM4YPHABZA.
Yes, it does compile and RTree is imported in the various other modules now. The only thing that you should really check before merging is
in UDConcepts, simplified prUDSentence so that sent_ids are just numbers starting from 1 (but it looks like there are good reasons to revert this?)
The numbering was different because of some requirements of the malt parser, but it was a bit peculiar, I wonder if it is still necessary to keep it that way...?
Does it produce different numberings in the resulting CoNLL for any input?
From: Arianna Masciolini notifications@github.com Sent: Tuesday, March 2, 2021 4:56:21 PM To: GrammaticalFramework/gf-ud Cc: Aarne Ranta; Comment Subject: Re: [GrammaticalFramework/gf-ud] various changes for concept alignment (#4)
Yes, it does compile and RTree is imported in the various other modules now. The only thing that you should really check before merging is
in UDConcepts, simplified prUDSentence so that sent_ids are just numbers starting from 1 (but it looks like there are good reasons to revert this?)
The numbering was different because of some requirements of the malt parser, but it was a bit peculiar, I wonder if it is still necessary to keep it that way...?
— You are receiving this because you commented. Reply to this email directly, view it on GitHubhttps://github.com/GrammaticalFramework/gf-ud/pull/4#issuecomment-789011375, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AAWBQXMRQPF75TUPG4JXLGTTBUDCLANCNFSM4YPHABZA.
For every input, because the original numbering started from 1000001 and each id was preceded by "gfud", so gfud1000001,gfud1000002 and so on. The new one is instead 1,2...
But obviously I can switch back
I prefer the old numbering, because it is easier to sort by Unix tools. And also change globally, e.g. gfud1 -> gfud2 if we want to combine two treebanks.
Aarne.
From: Arianna Masciolini notifications@github.com Sent: Tuesday, March 2, 2021 5:15:15 PM To: GrammaticalFramework/gf-ud Cc: Aarne Ranta; Comment Subject: Re: [GrammaticalFramework/gf-ud] various changes for concept alignment (#4)
For every input, because the original numbering started from 1000001 and each id was preceded by "gfud", so gfud1000001,gfud1000002 and so on. The new one is instead 1,2...
But obviously I can switch back
— You are receiving this because you commented. Reply to this email directly, view it on GitHubhttps://github.com/GrammaticalFramework/gf-ud/pull/4#issuecomment-789024804, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AAWBQXJOJUCE77D2V6ERWUDTBUFJHANCNFSM4YPHABZA.
Now it is again as it was initially, I'm not sure about the other change you were mentioning
Numbering the sentences was the only change I needed. Everything is merged now. Thanks for your contributions!
Aarne.
From: Arianna Masciolini notifications@github.com Sent: Tuesday, March 2, 2021 5:23:41 PM To: GrammaticalFramework/gf-ud Cc: Aarne Ranta; Comment Subject: Re: [GrammaticalFramework/gf-ud] various changes for concept alignment (#4)
Now it is again as it was initially, I'm not sure about the other change you were mentioning
— You are receiving this because you commented. Reply to this email directly, view it on GitHubhttps://github.com/GrammaticalFramework/gf-ud/pull/4#issuecomment-789030983, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AAWBQXMRRPO6CS3KE4J23HTTBUGI3ANCNFSM4YPHABZA.
Here is a summary of the changes:
AlignTrees
(well, moved it to theconcept-alignment
repository)BuildGFGrammar
(to use it inconcept-alignment
'sGenerateGrammar
, these two should probably become a single program which could belong to either repository)RTree
functions not specific to GF or UD trees fromGFConcepts
to a new module calledRTree
RTree
/
(already opened a pull request specifically for that)UDConcepts
(e.g. for comparing trees taking only some features into account)UDConcepts
, simplifiedprUDSentence
so thatsent_ids
are just numbers starting from 1 (but it looks like there are good reasons to revert this?).gfo
and.pgf
files to gitignore