tensorflow / datasets

TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
https://www.tensorflow.org/datasets
Apache License 2.0
4.31k stars 1.54k forks source link

[data request] Google conceptual captions #1036

Open SMercier88 opened 5 years ago

SMercier88 commented 5 years ago

Folks who would also like to see this dataset in tensorflow/datasets, please thumbs-up so the developers can know which requests to prioritize.

And if you'd like to contribute the dataset (thank you!), see our guide to adding a dataset.

dhirensr commented 5 years ago

@SMercier88 : the training dataset would contain image links and captions right? as per the website in the dataset file they provide links and image captions. I can work on this dataset.

SMercier88 commented 5 years ago

@dhirensr yes, that is exactly it. Thank you!

dhirensr commented 5 years ago

@Conchylicultor : if the data is in .tsv format and it doesn't need to be extracted then should i download it manually? because when I give the link of .tsv it's not downloading properly maybe because it's trying to extract but it should not. what would be the solution?

Conchylicultor commented 5 years ago

@dhirensr use dl_manager.download(url) instead of dl_manager.download_and_extract() ?

dhirensr commented 5 years ago

@Conchylicultor @SMercier88 : added this dataset! please review https://github.com/tensorflow/datasets/pull/1165