Clustering with collocations -- would it lead to different results?

Computational-Content-Analysis-2018 / 19-Jan-Flat-Clustering

Manning, Christopher, Prabhakar Raghavan and Hinrich Schütze. 2008. “Flat Clustering” and “Hierarchical Clustering.” Chapters 16 and 17 from Introduction to Information Retrieval.

https://github.com/Computational-Content-Analysis-2018

0 stars 1 forks source link

Clustering with collocations -- would it lead to different results? #6

Open sunnyjooey opened 6 years ago

sunnyjooey commented 6 years ago

I don't think the chapters go into detail on what the unit of analysis is, but I'm assuming it's a word. So, would clustering based on collocations (groups of words that have meaning together) lead to significantly different clusters? Is there a way to capture the relationships between words (instead of just presence or frequency) in a way that is informative to clustering?