Closed gobigrassland closed 4 months ago
我看到用到的Unet模型参数与SD1.4模型配置参数,就是其中cross_attention_dim和in_channels的区别。 (1)唇语模型UNet: cross_attention_dim=384, in_channels=8 (2)SD1.4 UNet: cross_attention_dim=768, in_channels=4
是从随机初始化开始训练的
我看到用到的Unet模型参数与SD1.4模型配置参数,就是其中cross_attention_dim和in_channels的区别。 (1)唇语模型UNet: cross_attention_dim=384, in_channels=8 (2)SD1.4 UNet: cross_attention_dim=768, in_channels=4