Is it normal that FID affected by the num_gen option?

atong01 / conditional-flow-matching

TorchCFM: a Conditional Flow Matching library

MIT License

1.25k stars 101 forks source link

Yes, it is normal. The FID is defined as the squared 2-Wasserstein distance between two approximated Gaussians. We first embed generated and real data within the deepest Inception layer (to compare features rather than pixels) and then, we compute Gaussian statistics of the generated and real samples. The more samples we have to compute the Gaussian statistics, the better the approximation is.

The process takes indeed ~35 min on 1 A100. Increasing the batch size and the number of GPUs can reduce this time accordingly.

atong01 / conditional-flow-matching

Is it normal that FID affected by the num_gen option? #111