meghdadFar / wordview

A Python package for Exploratory Data Analysis (EDA) for text-based data.
MIT License
11 stars 1 forks source link

Bug Report: Some MWEs appear in multiple classes #143

Open meghdadFar opened 6 months ago

meghdadFar commented 6 months ago

Describe the bug

Some multiword expressions appear in multiple (close) classes. For instance, femme fatale appears both in Noun Noun and Adjective Noun compounds. This can be due to POS tagging inaccuracies and is also a known problem. But it could also be due to some other issues. It has to be investigated and fixed. When the issue is with POS tags, we have to decide in which category we place it and put it only there in that category, and avoid presenting one MWE in two categories.

To Reproduce

Follow docs to generate MWEs.

Expected behaviour

A clear and concise description of what you expected to happen.

Screenshots

Screenshot 2024-04-08 at 09 59 27