Manning, Christopher, Prabhakar Raghavan and Hinrich Schütze. 2008. “Flat Clustering” and “Hierarchical Clustering.” Chapters 16 and 17 from Introduction to Information Retrieval.
While a lot of the technicalities and math escaped me, I got the sense that coding for classification of text is in itself difficult enough. As some of the other posts mentioned, there is a lot of room for error, especially when it depends on the initial human input. Despite these existing difficulties, can this method be applied to different forms of data, for instance, both text and images? If so, how would it be a similar process, or would it require a different approach?
While a lot of the technicalities and math escaped me, I got the sense that coding for classification of text is in itself difficult enough. As some of the other posts mentioned, there is a lot of room for error, especially when it depends on the initial human input. Despite these existing difficulties, can this method be applied to different forms of data, for instance, both text and images? If so, how would it be a similar process, or would it require a different approach?