minyoungg / platonic-rep

382 stars 19 forks source link

Does the author try diffusion models when aligning with the language model? #5

Open MaureenZOU opened 1 week ago

MaureenZOU commented 1 week ago

As for the question, I am very curious about the vision encoder with diffusion models, and how does this align with the semantic world.

dribnet commented 1 week ago

IIUC: this recent paper from @yossigandelsman and others claims that the weight space of diffusion models also has interpretable latent spaces and so perhaps these could also be tested for alignment as you suggest.