tangjiapeng / DiffuScene

[CVPR 2024] DiffuScene: Denoising Diffusion Models for Generative Indoor Scene Synthesis
Other
218 stars 19 forks source link

Combine or Concatenate encoded attribute features? #26

Open Starry-lei opened 4 months ago

Starry-lei commented 4 months ago

Hi, I find it is a combination, not the concatenation illustrated on paper, right?

if self.seperate_all: x_class = self.class_embedf(x[:, self.bbox_dim:self.bbox_dim+self.class_dim, :]) if self.objectness_dim >0: x_object = self.objectness_embedf(x[:, self.bbox_dim+self.class_dim:self.bbox_dim+self.class_dim+self.objectness_dim, :]) else: x_object = 0

        if self.objfeat_dim > 0:
            x_objfeat = self.objfeat_embedf(x[:, self.bbox_dim+self.class_dim+self.objectness_dim:self.bbox_dim+self.class_dim+self.objectness_dim+self.objfeat_dim, :])
        else:
            x_objfeat = 0

        x_bbox = self.bbox_embedf(x[:, 0:self.bbox_dim, :])
        x = x_class + x_bbox + x_object + x_objfeat