Exploring alignment with multiple models?

sihyun-yu / REPA

Official Pytorch Implementation of Representation Alignment for Generation: Training Diffusion Transformers Is Easier Than You Think

MIT License

663 stars 30 forks source link

Hi @sihyun-yu ,

Great work!

I've also been working on taking advantage of pre-trained visual representations to supervise a model for a different task. Here is our work Theia to improve visual representation for robot learning. I also want to mention NVIDIA's work RADIO, which solves vision problems.

Both approaches use multiple teacher models and we found this improves the representation a lot. I also noticed that you have implemented the multi-teacher training in your codebase. Did you explore this aspect and would you mind sharing your observations if you had?

sihyun-yu / REPA

Exploring alignment with multiple models? #8