Closed chozelinek closed 5 years ago
The noun chunks depend on the part-of-speech tags and dependency parse, so this issue likely comes down to incorrect predictions made by the tagger or parser.
I'm merging this with #3052. We've now added a master thread for incorrect predictions and related reports – see the issue for more details.
This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.
The problem
I've realised that sometimes noun chunks yield a noun chunk which is embedded in a longer one. I have only identified this behaviour in a few examples involving clauses with which.
Take the sentence "Including equity share of refineries in which the Group has a stake."
"the Group" and "in which the Group has a stake" are marked as noun chunks. But this does not happen normally. I put below a few examples so you can reproduce and study this.
How to reproduce the behaviour
These are my comments on the texts analyzed:
Your Environment