RedHatInsights / aiops-insights-clustering

Clustering of systems
GNU General Public License v3.0
1 stars 14 forks source link

Clustering code cleanup #9

Closed tumido closed 5 years ago

tumido commented 5 years ago

I'm trying to make the clustering code readable. Let's simplify and optimize it.

Related: ~https://github.com/numpy/numpy/issues/11999~ (Not any more relevant)

tumido commented 5 years ago

@durandom would you be interested in this type of PRs? I have also some questions about the current codebase - I'm placing them inline as this PR review.

tumido commented 5 years ago

cc @bronaghs

tumido commented 5 years ago

Also the numpy issue shouldn't matter anymore, since the missing data handling can be done in Pandas directly. I'll fix that in later iteration of this PR later today...

durandom commented 5 years ago

@MichaelClifford could you do a review too?

tumido commented 5 years ago

@durandom, I've rewritten the rest:

@Ladas, here you go. This is the "Pandas only" approach I was talking about