Open haibao-yu opened 1 month ago
We follow the implementation of ControlNet, where the control signal (map) is first added with x (noisy latent) and then goes through the copied encoder block. The whole file you referred to is the encoder from Figure 3 (not just the conv block).
This issue is stale because it has been open for 7 days with no activity. If you do not have any follow-ups, the issue will be closed soon.
Thanks for your great work! I have a question about your implementation:
https://github.com/cure-lab/MagicDrive/blob/6038a24de9fc4df641a9775aab3f735a3c590589/magicdrive/networks/unet_addon_rawbox.py#L743-L850