libffcv / ffcv

FFCV: Fast Forward Computer Vision (and other ML workloads!)
https://ffcv.io
Apache License 2.0
2.84k stars 178 forks source link

Merging two FFCV .beton datasets #266

Open manideep2510 opened 1 year ago

manideep2510 commented 1 year ago

Hi

Let's say we have two datasets written to two FCCV dataset files say dataset1.beton and dataset2.beton. Assume both these datasets have the exact same image and label format and are written using the same method. Is it possible to take these two .beton dataset files and merge them into a single merged-dataset.beton FFCV dataset file?

This will be very useful when we want to continuously increase the size of the training datasets through continuous data collection.

Thanks!

IlyaMescheryakov1402 commented 1 year ago

+1

It would be very helpful in production systems

kschuerholt commented 1 year ago
AlexSunNik commented 1 year ago

+1 Any update on this issue? This feature will be significantly useful for our usecase as well.