chanzuckerberg / cellxgene-census

CZ CELLxGENE Discover Census
https://chanzuckerberg.github.io/cellxgene-census/
MIT License
84 stars 20 forks source link

Benchmark PyTorch DataLoader #514

Closed atolopko-czi closed 1 year ago

atolopko-czi commented 1 year ago

Run benchmarks to establish the baseline performance of the PyTorch DataLoader. The purpose is to ensure that performance is not significantly slower than an equivalent ExperimentAxisQuery data loading. This effectively tests that the iteration and Tensor transformation logic is not incurring significant performance degradation.

pablo-gar commented 1 year ago

@atolopko-czi I'm marking this as a P0 since it is relevant to our work related to SCVI

atolopko-czi commented 1 year ago

Note (as discussed), SCVI-specific benchmarking is tracked by https://github.com/chanzuckerberg/cellxgene-census/issues/530. This is just "raw" Census PyTorch DataLoader benchmarking, w/o any model training component.

atolopko-czi commented 1 year ago

Profiling code and results will be stored in the repo: https://github.com/atolopko-czi/census-scvi-bench

atolopko-czi commented 1 year ago

Initial benchmarking results have been run: analysis