GoogleCloudPlatform / dataflux-pytorch

The Dataflux Accelerated Dataloader for PyTorch with GCS is an effort to improve ML-training efficiency when using data stored in GCS for training datasets. Using the Dataflux Accelerated Dataloader for training is up to 3X faster when the dataset consists of many small files (e.g., 100 - 500 KB).
Apache License 2.0
26 stars 4 forks source link

Fixing build failures #125

Closed Yash9060 closed 1 week ago

Yash9060 commented 1 week ago

Got a failure in continuous benchmark test, seems like PR to add single node benchmarking to continuous runs was raised after I raised PR#121 .

Replaced all instances of lightning_checkpoint_benchmark.py with checkpointing/singlenode/train.py