Open vcvpaiva opened 7 years ago
Similarly we need new additions for "particle verbs" and prepositions (@arademaker any suggestions from Francis Bond?)
@vcvpaiva rules for post-processing the dependencies nn -> nn can suggest mwe candidates for example? But as @fcbr said, we need some KB knowledge too. this is a good topic to discuss not here.
I didn't understand the second comment.
@arademaker the dependencies called noun-noun in SD will produce candidates for mwes. some are crystalized and might be found in a resource such as PWN (or an extension of PWN that paid more attention to compounds like ARK). e.g. paddling pool, vending machine, soccer player.
others are wrong, like "man eating" in "There is no man eating", where "eating" is not a noun.
others are not crystallized (like forest path) and are difficult to decide how to deal with them (use both concepts or just the head?) . finally, others still, like "tiger cub" I cannot even tell if it's crystalized or not.
about my second comment it has two parts: 1.you mention that Francis might be involved with a LKB (lexical knowledge base) of prepositions? if we don't go the whole hog looking for such, we should at least use the ones in the SDs manual http://nlp.stanford.edu/software/dependencies_manual.pdf as a post-processing step.
issue #65, issue #36
As commented above there several kinds of multiword expressions that we need to have. this issue has now been restricted to mwes that exist in PWN, like vending_machine and swimming_pool. unfortunately paddling_pool does not seem to exist in PWN
As the careful analysis of 380 SICK sentences shows (#57) we need to reconstruct mwes before trying to map them to SUMO. Easy examples are "baby panda" ---> not a HumanBaby + Panda, "cartoon airplane" is not an airplane.
@fcbr, @arademaker any suggestions here, once you are given the nn or mwe label?