result issue @ fid. - Githubissues

Hi, I'm once again amazed by your great work. However, there seem to be some issues with testing the FID in the four scenarios. According to GANwriting's description,

first, a synthetic dataset is generated for each author wid_i based on in-vocabulary and out-of-vocabulary words,
and then the FID is calculated against the real dataset of each author wid_i,
and finally, it's averaged.

My understanding is as follows:

Two dictionaries: in-vocabulary dictionary (114 words) and out-of-vocabulary dictionary (400 words). Five hundred authors: visible style (339, train) and invisible style (161, test). Synthetic dataset: a total of 1000 samples, 339 * 2 + 161 + 2. However, the results I obtained differ significantly from the levels reported by VATr (delta over 50). Is my understanding correct, or is there something I'm missing?

Looking forward to your reply! 😄

aimagelab / VATr

result issue @ fid. #11