google-research / scenic

Scenic: A Jax Library for Computer Vision Research and Beyond
Apache License 2.0
3.26k stars 428 forks source link

[vid2seq] Access to the ASR data #776

Closed PKUCSS closed 1 year ago

PKUCSS commented 1 year ago

@antoyang @a-nagrani Dear authors, thanks for the great work. Could you please provide the transcribed ASR data of the YouCook2 and AcitivityNet Captions datasets you used in the experiments so that more followers could reproduce your results on these downstream datasets?

I know that we can produce ASR data by utilizing open-sourced ASR models, calling commercial APIs, or processing the subtitle data from YouTube. However, I believe that access to your original ASR data used in the paper would contribute to the reproducibility of your work.

antoyang commented 1 year ago

Hi, unfortunately, releasing ASR data is not possible.

PKUCSS commented 1 year ago

Thanks anyway. I'm trying to reproduce the results in Huggingface Transformers and PyTorch, so I may propose other questions if I encounter some potential issues. Thanks again for your patient response.