cellarium-ai / cellarium-ml

Distributed single-cell data analysis.
BSD 3-Clause "New" or "Revised" License
11 stars 2 forks source link

Cellarium ML: distributed single-cell data analysis.


Cellarium ML is a PyTorch Lightning-based library for distributed single-cell data analysis. It provides a set of tools for training deep learning models on large-scale single-cell datasets, including distributed data loading, model training, and evaluation. Cellarium ML is designed to be modular and extensible, allowing users to easily define custom models, data transformations, and training pipelines.

Code organization

The code is organized as follows:

Installation

To install from the pip::

$ pip install cellarium-ml

To install the developer version from the source::

$ git clone https://github.com/cellarium-ai/cellarium-ml.git $ cd cellarium-ml $ make install # runs pip install -e .[dev]

For developers

To run the tests::

$ make test # runs single-device tests $ TEST_DEVICES=2 make test # runs multi-device tests

To automatically format the code::

$ make format # runs ruff formatter and fixes linter errors

To run the linters::

$ make lint # runs ruff linter and checks for formatter errors

To build the documentation::

$ make docs # builds the documentation at docs/build/html