MarioniLab / oor_design_reproducibility

14 stars 1 forks source link

Disease-state benchmarking API #15

Closed emdann closed 1 year ago

emdann commented 1 year ago

Following structure of OpenProblems tasks

Datasets

Datasets consist of one anndata object with

In first instance this should include utils for data splitting and checking for important columns. Then real disease datasets can be served via the API.

Methods

Methods are wrappers around a workflow (e.g. scVI + scArches + milo) with a specific selection of reference and query dataset. Naming convention: integrationMethodID_diffMethodID_integrationReference_diffReference

Methods return:

Metrics

API

Each method produces a statistic over groups of cells (e.g. clusters/nhoods/metacells)