Closed sgratiy closed 3 years ago
Prototype performance on synthetic dataset 20k cells by 30k genes:
in memory: builtin saving to backing file (terrible time and memory performance):
my implementation (fast and tight):
Testing on small dataset(300k) fully in memory:
in backed mode saving chunks to a different file (using just 10% of memory):
Prototype performance on synthetic dataset 20k cells by 30k genes:
in memory: builtin saving to backing file (terrible time and memory performance):
my implementation (fast and tight):
Testing on small dataset(300k) fully in memory:
in backed mode saving chunks to a different file (using just 10% of memory):