How do you measure FID, sFID, IS and CLIP Score?

Hi, we use the ADM’s TensorFlow evaluation suite (link) for evaluation. After you get the generated images, convert them into an .npz file (only ImageNet needs to do this manually, others are generated automatically) and run the evaluation. The script for conversion is as follows:

def create_npz_from_sample_folder(sample_dir, npz_path, num=50_000):
    """
    Builds a single .npz file from a folder of .png samples.
    """
    samples = []
    files = os.listdir(sample_dir)
    for i in tqdm(range(len(files)), desc="Building .npz file from samples"):
        sample_pil = Image.open(f"{sample_dir}{files[i]}")
        sample_np = np.asarray(sample_pil).astype(np.uint8)
        samples.append(sample_np)
    samples = np.stack(samples)
    print(samples.shape)
    np.savez(npz_path, arr_0=samples)
    print(f"Saved .npz file to {npz_path} [shape={samples.shape}].")
    return npz_path

hatchetProject / QuEST

How do you measure FID, sFID, IS and CLIP Score? #3