own-pt / rte-sick

RTE Experiment
1 stars 3 forks source link

rewriting rule 3: "vending_machine" #58

Open vcvpaiva opened 7 years ago

vcvpaiva commented 7 years ago

As the careful analysis of 380 SICK sentences shows (#57) we need to reconstruct mwes before trying to map them to SUMO. Easy examples are "baby panda" ---> not a HumanBaby + Panda, "cartoon airplane" is not an airplane.

@fcbr, @arademaker any suggestions here, once you are given the nn or mwe label?

vcvpaiva commented 7 years ago

Similarly we need new additions for "particle verbs" and prepositions (@arademaker any suggestions from Francis Bond?)

arademaker commented 7 years ago

@vcvpaiva rules for post-processing the dependencies nn -> nn can suggest mwe candidates for example? But as @fcbr said, we need some KB knowledge too. this is a good topic to discuss not here.

arademaker commented 7 years ago

I didn't understand the second comment.

vcvpaiva commented 7 years ago

@arademaker the dependencies called noun-noun in SD will produce candidates for mwes. some are crystalized and might be found in a resource such as PWN (or an extension of PWN that paid more attention to compounds like ARK). e.g. paddling pool, vending machine, soccer player.

others are wrong, like "man eating" in "There is no man eating", where "eating" is not a noun.

others are not crystallized (like forest path) and are difficult to decide how to deal with them (use both concepts or just the head?) . finally, others still, like "tiger cub" I cannot even tell if it's crystalized or not.

vcvpaiva commented 7 years ago

about my second comment it has two parts: 1.you mention that Francis might be involved with a LKB (lexical knowledge base) of prepositions? if we don't go the whole hog looking for such, we should at least use the ones in the SDs manual http://nlp.stanford.edu/software/dependencies_manual.pdf as a post-processing step.

  1. we need to decide how to deal with particle verbs. maybe more can be added to PWN, but if not, some need to not produce concepts. the example I gave "the toddler is standing up". "up" is just a particle that determines the sense of standing, but if in "the index is up", "up" should map to "Increasing".

issue #65, issue #36

vcvpaiva commented 7 years ago

As commented above there several kinds of multiword expressions that we need to have. this issue has now been restricted to mwes that exist in PWN, like vending_machine and swimming_pool. unfortunately paddling_pool does not seem to exist in PWN