atosystem / SpeechCLIP

SpeechCLIP: Integrating Speech with Pre-Trained Vision and Language Model, Accepted to IEEE SLT 2022
https://atosystem.github.io/blogs/speechclip
BSD 3-Clause "New" or "Revised" License
108 stars 6 forks source link

Dataset source? #1

Closed FlyToYourMooN closed 2 years ago

FlyToYourMooN commented 2 years ago

Outstanding job! I just can't seem to find the link to the dataset in the cited paper

atosystem commented 2 years ago

@Biqigubafan Sorry, I forget to include the data preparing script. I will update the repo these days.

atosystem commented 2 years ago

@FlyToYourMooN I have updated the repo with data preparation instructions. I will close this issue now. Feel free to open it again if there is still something wrong. Thanks~