illinois-or-research-analytics / cm_pipeline

Pipeline that uses an improved version of CM for generating well-connected clusters
GNU General Public License v3.0
5 stars 4 forks source link

Loads column 2 as character in utf-8 #55

Closed MinhyukPark closed 5 months ago

MinhyukPark commented 5 months ago

This PR addresses two things.

  1. loads specifically the second column as a utf-8 encoded character
  2. outputting values at the end without double quotes surrounding the cluster names
MinhyukPark commented 5 months ago

addresses #54

MinhyukPark commented 5 months ago

TODO: enforce integer ids for column 1 as well. Don't close #54 yet.