Closed yujinqiu closed 7 months ago
I've attempted to perform the normalization in a manner similar to the PyTorch version and obtained the same results, so I believe the issue does not lie with the normalization process.
I guess it's because the dim(512) is high, even if there's a perfect match within a low-dimensional manifold, the overall average similarity could still appear low due to the effect of the remaining dimensions.
The sim score between image and image is normal, so I think the metric is correct.
Hi there, I'm try to print the top N similarity with the following output, sim is less than 0.5, but the result for human is right. I'm not sure weather it's a bug or not.