jwzimmer-zz / tv-tropes

UVM Stat 287 Final Project repo - network of tropes from TV Tropes wiki
MIT License
3 stars 3 forks source link

Run community detection now that we have tractably-sized networks #23

Open jwzimmer-zz opened 3 years ago

jwzimmer-zz commented 3 years ago

Use methods from https://github.com/jwzimmer/tv-tropes/issues/21

compare the categories we get to:

jwzimmer-zz commented 3 years ago

Index network clusters

Girvan-Newman 1) Girvan-Newman 6 clusters (b/c of the number of neighborhoods found by Stanford): https://github.com/jwzimmer/tv-tropes/commit/96e70cc68aed8db7cef072b5060aa98959c3889d on this network: https://github.com/jwzimmer/tv-tropes/issues/22#issuecomment-727626405.

4) Girvan-Newman 12 clusters using graph in BigFour_tropes_all4_top10000_top50_top20.gml: https://github.com/jwzimmer/tv-tropes/commit/2e378170891c2a19c5c3726527fe1252f7f69ac6

jwzimmer-zz commented 3 years ago

for now i'm only doing girvan-newman clustering for the indices: we're not sure if seeing how they cluster will be interesting compared to how the tropes cluster; if it looks like it is we can come back and do different kinds of clustering with the indices as well

nguyenhphilip commented 3 years ago

woo! finally got something to work. here's a group of 5 clusters with 100 nodes and 313 edges. each node was limited to 5 edges in adding them to the graph. more could have been added to a particular trope as edges were created for the tropes that were later added in the for-loop. but the overall size looks manageable!

https://github.com/jwzimmer/tv-tropes/commit/0f400773a32eb0aee3e57710ef130d3a5df7f84b