Function _encode_stage_2inputs of the VQ-VAE outputs the quantised latent representations. Since _decode_stage_2outputs quantises the input anyway, we could have a flag in _encode_stage_2inputs to allow for the user to train an LDM on non-quantised representations.
Function _encode_stage_2inputs of the VQ-VAE outputs the quantised latent representations. Since _decode_stage_2outputs quantises the input anyway, we could have a flag in _encode_stage_2inputs to allow for the user to train an LDM on non-quantised representations.