Open rom1504 opened 2 years ago
do a laion-like dataset but with multiple caption per sample : dedup per url+caption not by url ; then group by url at the end
This may also help to select the best caption for a given image
do a laion-like dataset but with multiple caption per sample : dedup per url+caption not by url ; then group by url at the end