DS4PS / cpp-529-spr-2020

Course shell for CPP 529 Data Practicum on Community Analytics for Spring 2020.
http://ds4ps.org/cpp-529-spr-2020
1 stars 1 forks source link

Final Project : Identifying communities #16

Open Niagara1000 opened 4 years ago

Niagara1000 commented 4 years ago

Hi Professor @Anthony-Howell-PhD ,

Under the Identifying Communities section of the Final Project Outline Storyboard, there is this code. did you want us to create 9 clusters or should we stick to fewer? Your comments are included in the snippet that I am pasting below:


..
..

#Run Cluster Analysis
mod2 <- Mclust(Census2010[keep.these1]) 
# Set groups to 5, but you can remove this to let r split data into own groupings

summary(mod2, parameters = TRUE)
Census2010$cluster <- mod2$classification
AntJam-Howell commented 4 years ago

@Niagara1000 in that code example there is no pre-defined specification for number of groups. The number of groups will be determined based on the clustering algorithm. If you want to force the data into a specific number of groups, lets say four, you need to add the G=4 to the Mclust function as we did in Lab 6.