uptrain-ai / uptrain

UpTrain is an open-source unified platform to evaluate and improve Generative AI applications. We provide grades for 20+ preconfigured checks (covering language, code, embedding use-cases), perform root cause analysis on failure cases and give insights on how to resolve them.
https://uptrain.ai/
Apache License 2.0
2.2k stars 188 forks source link

Add HDBSCAN Clustering #226

Closed vipgupta closed 1 year ago

vipgupta commented 1 year ago

Currently, we have DBSCAN clustering implemented as shown below.

https://github.com/uptrain-ai/uptrain/blob/8d5b504561f6c7634553480d1e38ec9a9b5716ef/uptrain/core/classes/visuals/dimensionality_reduction.py#L292

We also want to add the HDBSCAN algorithm, where the clustering algorithm can be defined in the UpTrain config. Please also add details in the documentation on how to define the clustering algorithms to be used through UpTrain and when to use DBSCAN versus HDBSCAN.

Yashi11 commented 1 year ago

Hi! I would like to work on this. Please assign it to me.

vipgupta commented 1 year ago

Hi @Yashi11, Done!

vipgupta commented 1 year ago

Solved by #240