I am running into problem with the confusion around the definition and usage of the variables df_centroids.point_id, df_centroids.cluster_labels and the implementation of non numeric point_id.
I think this must be clarified:
So df_centroids is the central table keeping track of the point at which downscaling occurs. These points have a name, lat, lon and many other attributes.
point_id: originally was loosely defined as the index of each point in the table, as well as a surrogate of a name of point. It will be split in 2:
point_name that is a string,
point_ind that will be an integer corresponding to the table index df_centroids.index
cluster_labels: output of the kmean algo. This is an integer.
I am running into problem with the confusion around the definition and usage of the variables
df_centroids.point_id
,df_centroids.cluster_labels
and the implementation of non numericpoint_id
.I think this must be clarified: So
df_centroids
is the central table keeping track of the point at which downscaling occurs. These points have a name, lat, lon and many other attributes.point_id
: originally was loosely defined as the index of each point in the table, as well as a surrogate of a name of point. It will be split in 2:point_name
that is a string,point_ind
that will be an integer corresponding to the table indexdf_centroids.index
cluster_labels
: output of the kmean algo. This is an integer.