Vchitect / Latte

Latte: Latent Diffusion Transformer for Video Generation.
Apache License 2.0
1.45k stars 147 forks source link

Discriminative tasks #32

Open bhack opened 4 months ago

bhack commented 4 months ago

Have you tested your arch on discriminative tasks like Video/Panoptic segmentation? There was some promising effort recently but on images: https://github.com/cp3wan/DFormer

maxin-cn commented 4 months ago

Have you tested your arch on discriminative tasks like Video/Panoptic segmentation? There was some promising effort recently but on images: https://github.com/cp3wan/DFormer

Thank you for your interest. We mainly tested our structure on the task of Video generation, and no Video/Panoptic segmentation tasks have been tested so far.