Open zideliu opened 1 year ago
num_class of my dataset is 15
Did you find any solution to this? My guess since the mapping file (of shape [embed_dim] which is 1D ) consists of indices [0,2887] arranged in an order to map the latent variable to codebook entry. And author here is not using GumbleQuantization, but is using mapping to quantize vectors. So probably commenting out the quantization part that uses the mapping file and use the learnt Gumble quantization model using quant.emb.weights of the trained VQGAN model
Any ideas to solve this problem?
I trained taming-transformers on my own data set and got the ckpt file and the corresponding yaml file. When I apply it to vq-diffusion, an error will be reported. I followed
configs/imagenet.yaml
. , only the ckpt file path and the corresponding yaml file path are replaced.I feel that some parameters need to be adjusted accordingly, but due to personal ability problems, I have not debugged it. My personal suspicion is that
help_folder/statistics/taming_vqvae_974.pt
may be different from the parameters I used to train taming-transformers. If you can provide training ifhq dataset details, I would be greatly appreciated.configs/mydataset.yaml
OUTPUT/pretrained_model/mydataset/mydataset.yaml