mozhao0331 / Restaurant_Segmentation_Analysis

MIT License
2 stars 2 forks source link

2023-05-31 Official Partner Meeting Agenda #87

Closed erictsai1208 closed 1 year ago

erictsai1208 commented 1 year ago

Moderator: Eric Tsai Notetaker: Chen Lin

Weekly check-in:

Cluster result:

  1. 31.5% Local community? - lower population density, some houses, but we also see community centres, churches, schools, etc.
  2. 24.97% Residential area - seems like there are lots of standalone houses nearby
  3. 1.47% Work? - lots of tall buildings (offices) and stores packed in one region
  4. 25.82% Shopping centres - lots of parking lots, more stores packed in one region
  5. 16.23% Travel/major intersection - many highways, intersections and hotels nearby

Question:

For fuzzy c-means (FCM) clustering, the best metrics for grid search are:

These metrics specifically take into account the fuzzy membership assignments in FCM clustering, in addition to the usual notions of within-cluster vs between-cluster distances.

CChCheChen commented 1 year ago

Action Item:

  1. Sending the demo cluster store number and cluster label to Sitewise
  2. Sending the list of features used to build the current best model

Meeting notes:

  1. Try elbow method to see what are the optimal number of cluster (if this is applicable for FCMeans)
    Optimal K value using "knee point detection algorithm"
  2. Try population density (or similar) and daytime population density ratio for super urban area
    daytime pop vs. residence
  3. Consider $\color{red} {\text{within-cluster}}$ over between-cluster distances
  4. Use test set to verify the modelled cluster, see if the same pattern of clusters still present
  5. Approved to use unsupervised to supervised validation approach
  6. No pressure for last model using Subway CAN
erictsai1208 commented 1 year ago

Cluster result discussion: