Open jkanche opened 3 months ago
I ran into Tiledb's ML repo (https://github.com/TileDB-Inc/TileDB-ML/blob/master/tiledb/ml/readers/pytorch.py), which seems to implement a pytorch based dataloader. It looks like we can speed up our dataloaders without setting threads to 1.
Mostly exploration and then figuring out if we can adopt the same logic to our implementations.
Setting context to "spawn" will remove the need to setting threads to 1.
I ran into Tiledb's ML repo (https://github.com/TileDB-Inc/TileDB-ML/blob/master/tiledb/ml/readers/pytorch.py), which seems to implement a pytorch based dataloader. It looks like we can speed up our dataloaders without setting threads to 1.
Mostly exploration and then figuring out if we can adopt the same logic to our implementations.