Benchmark PyTorch DataLoader

atolopko-czi commented 1 year ago

Run benchmarks to establish the baseline performance of the PyTorch DataLoader. The purpose is to ensure that performance is not significantly slower than an equivalent ExperimentAxisQuery data loading. This effectively tests that the iteration and Tensor transformation logic is not incurring significant performance degradation.

Varying obs, ~var~ sizes.
Dense vs Sparse tensors
Batch size
SOMA buffer bytes config option
Single workers vs multiple workers
Eager vs Lazy fetch
~Tensor precision: torch.set_float32_matmul_precision('medium' | 'high') docs~

pablo-gar commented 1 year ago

@atolopko-czi I'm marking this as a P0 since it is relevant to our work related to SCVI

atolopko-czi commented 1 year ago

Note (as discussed), SCVI-specific benchmarking is tracked by https://github.com/chanzuckerberg/cellxgene-census/issues/530. This is just "raw" Census PyTorch DataLoader benchmarking, w/o any model training component.

atolopko-czi commented 1 year ago

Profiling code and results will be stored in the repo: https://github.com/atolopko-czi/census-scvi-bench

atolopko-czi commented 1 year ago

Initial benchmarking results have been run: analysis

chanzuckerberg / cellxgene-census

Benchmark PyTorch DataLoader #514