cure-lab / MagicDrive

[ICLR24] Official implementation of the paper “MagicDrive: Street View Generation with Diverse 3D Geometry Control”
https://gaoruiyuan.com/magicdrive/
GNU Affero General Public License v3.0
419 stars 22 forks source link

question about view consistentence #3

Closed haofengsiji closed 6 months ago

haofengsiji commented 6 months ago

Does the term "hidden state of neighbor" refer to the latent representation of the neighboring image or the scene-level encoding embedding? Additionally, when addressing view consistency, does the model output multiple-view results simultaneously or only one view?

flymin commented 6 months ago
  1. it means "the latent representation of the neighboring image"
  2. multi-view images are generated simultaneously. The model parameters are shared for different views.