hms-dbmi-cellenics / issues

This repository is used to report and track issues

1 stars 0 forks source link

Remove clustering code from the pipeline. Trigger Clustering in the worker, as we do UMAP #30

Open gerbeldo opened 10 months ago

gerbeldo commented 10 months ago

Background

Clustering code is both in the pipeline and the worker, completely duplicated (tests included).

Currently, the first time clustering is calculated it is done in the pipeline, and that is what is shown to users. If a user changes a clustering parameter, a work request is triggered. And from then on, clustering is always performed in the worker.

@kafkasl has implemented a mechanism to trigger a work request for the UMAP and marker heatmap automatically after the pipeline finishes. This could be extended to clustering, removing the need for duplicated code (and a lot simpler than previously proposed solutions, like an R package with code for clustering to be imported by both the pipeline and worker).

goal

Trigger a work request for clustering automatically at the end of the pipeline
remove clustering code and tests from the pipeline.