Regarding the description in the paper, "During inference, we use the CFG scale of 4 and 5 for M-Transformer and R-Transformer on HumanML3D, and (2, 5) on KIT-ML." I am not quite clear on how to set different CFG scale values for the two transformers during the inference process.
Regarding the description in the paper, "During inference, we use the CFG scale of 4 and 5 for M-Transformer and R-Transformer on HumanML3D, and (2, 5) on KIT-ML." I am not quite clear on how to set different CFG scale values for the two transformers during the inference process.