-
I don't think I've seen references to the _Scatter Separability Criterion_ (SSC) metric anywhere yet - neither in the Issues section here, nor on the mailing list.
Has it been ever considered and i…
-
There are many clustering algorithms in sklearn, we need to investigate which of these we can use for our application. Then we can also make a comparison of the quality of the flat clusterings produce…
-
Hi Alexander,
first of all: this is really a great R package. Thanks a lot for creating it!
When I tested the "fastnreliable" standard errors, bootstrap_type ‘31’ was computed super fast, while …
-
Algorithms/tools:
- [x] ~~antiSMASH~~
- [x] ~~[anvio](https://github.com/merenlab/anvio/blob/master/anvio/panops.py#L960)~~ (requires pangenome generation, leaving it out for now in favor of MCL, th…
-
Hello,
I just discovered that the index set used in cluster_by_isa uses byte long integers. (-127 to 128), which caused it to fail in my application with 250 microstates in the transition matrix.
``…
-
**What is the bug?**
> What we have is a race condition:
> 1. When the first test starts it creates a (local) ML Config index (which isn't needed)
> 2. When the first test finishes it issues a re…
-
It seems that the links from Google Scholar to some papers on proceedings.mlr.press are incorrect. For example, the link corresponding to the paper "Efficient Data Shapley for Weighted Nearest Neighbo…
-
I am a bioinformatics PhD and I really appreciate your mlr3cluster package. This package provides many unsupervised clustering algorithms. However, I regret to find that the two most commonly used alg…
-
**Aim**: Testing how the performance of different clustering algorithms for different datasets change on adding noise with different dimensions:
**To be done**: A jupyter notebook documentation …
-
When dereplicating sequences, it is crucial to cluster genomes that significantly overlap each other (i.e., high bidirectional coverage) to avoid clustering sequences where one is entirely contained w…