Closed suzhenghang closed 5 years ago
Haven’t verified yet - that’s next on my TODO list. Thanks for raising it.
I suspected that the embedding will not be 100% the same, so I’ll be writing test cases next week to see how different the embeddings are.
@suzhenghang I have just investigated this and the embeddings are much different. I don't know why this is, so I will be working on figuring this out over the next few weeks. Again, thanks for raising this.
Apologies for the amount of updates. I have found the problem and will be updating the code shortly to reflect this. I didn't account for the different dimensions of the data between tensorflow and pytorch when flattening the features output. As you can see, the comparison is identical: with a cosine distance of 0. There are some variations at the 5th decimal place, but the PCA postprocessor ignores these small variations and produces the correct embeddings. I'll close this issue now as this problem has been resolved. Thank you for reporting it.
Hi @harritaylor , I have inputed the piano.wav into tensorflow vggish, but the pca embedding is diff from pytorchvggish. Do you verify the output after the conversion?