-
### Feature Description
Implement a clustering-based retrieval method for RAG pipelines using algorithms like DBSCAN. This feature would cluster document embeddings during indexing and retrieve docum…
-
*This can wait after the release.*
A discussion happened in the GLM PR https://github.com/scikit-learn/scikit-learn/pull/14300 about what properties we would like `sample_weight` to have.
First…
-
It would be nice to have table with TPOT vs sklearn operators.
AFAIK not all operators from sklearn are included in tpot. It could be used as:
- roadmap for tpot
- some dim reduction techniques (e.g.…
-
I was recently troubleshooting some strange race conditions I was seeing in gtests and `cuda-memcheck --tool initcheck` raised the following trace:
```bash
========= Host API memory access error a…
-
```
It would be useful to get Instances of Clustering given the class name (String)
and Map of Options or String of Options to get an instance of Clusterer from
JSAT.
```
Original issue reported …
-
I'm opening this issue to track demo preparation for KubeCon Shanghai which happens on June 24, 2019. See details in the following Google Doc:
https://docs.google.com/document/d/16ZtxByFkbfQyvvF8b…
-
Make the code more generic such that 3d, 4d points are supported
-
Hi, I am running through the [Seurat tutorial Pipeline](https://satijalab.org/seurat/articles/visiumhd_analysis_vignette) and having trouble with just the first part of the Banksy code and getting the…
-
```
It would be useful to get Instances of Clustering given the class name (String)
and Map of Options or String of Options to get an instance of Clusterer from
JSAT.
```
Original issue reported …
-
It would be useful, especially to potential contributors, to have a unified description of how public interfaces should be structured. At first glance, I assumed we would be attempting to stay as clo…