Open wwfcnu opened 1 year ago
Can clap be used to filter data and calculate the similarity score between audio and caption?
Can clap be used to filter data and calculate the similarity score between audio and caption?