tensorflow / datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
https://www.tensorflow.org/datasets
Apache License 2.0
4.3k stars 1.54k forks source link

[data request] YouTube-8M Segments Dataset #5029

Open XinyangHan opened 1 year ago

XinyangHan commented 1 year ago

We collected human-verified labels on about 237K segments on 1000 classes from the validation set of the YouTube-8M dataset. Each video will again come with time-localized frame-level features so classifier predictions can be made at segment-level granularity. We encourage researchers to leverage the large amount of noisy video-level labels in the training set to train models for temporal localization.

Folks who would also like to see this dataset in tensorflow/datasets, please thumbs-up so the developers can know which requests to prioritize.

And if you'd like to contribute the dataset (thank you!), see our guide to adding a dataset.

fineguy commented 1 year ago

Hi @XinyangHan

Thanks for creating this request!

Our team has a limited bandwidth to implement all requested dataset. But do not hesitate to send a pull request if you already implemented this dataset.