Starting this issue to track minimal examples we can create to demonstrate effective usage and value of TorchData nodes. I can create separate issues for each of these as required.
Motivation, pitch
Vanilla torch.utils dataloader usage ported over to torchdata nodes
GPU accelerated transforms
Flexible parallelism (mixing multiprocessing with multithreading)
Examples porting over popular OSS datasets
connecting to popular cloud storage
Example creating new nodes (might get covered through examples above)
Basic multimodal model trained E2E using torchdata nodes
Chaining multiple transforms (might get covered through examples above)
Dataset mixing (with different sampling strategies)
🚀 The feature
Starting this issue to track minimal examples we can create to demonstrate effective usage and value of TorchData nodes. I can create separate issues for each of these as required.
Motivation, pitch
Alternatives
No response
Additional context
No response