Closed molbap closed 4 months ago
HF datasets support all in chug now. A big ? if we add support for other forms such as csv / file-folder directly or focus on webdataset, HF datasets, and possibly other sharded formats that we create to address specific needs / performance...
Currently in pixparse other dataloaders are defined by e.g.
Instead of having this util in pixparse, we can write it here to handle batch creation at a lower level, and then use chug normally from pixparse lib.