huggingface / diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
https://huggingface.co/docs/diffusers
Apache License 2.0
26.38k stars 5.43k forks source link

Add support to Meissonic #9794

Open viiika opened 4 weeks ago

viiika commented 4 weeks ago

Meissonic is a non-autoregressive mask image modeling text-to-image synthesis model that can generate high-resolution images. It is designed to run on consumer graphics cards.

The model checkpoint can be found in https://huggingface.co/MeissonFlow/Meissonic The inference code can be found in https://github.com/viiika/Meissonic The paper can be found in https://arxiv.org/abs/2410.08261

Anyone interested in implementing this?

protimroy commented 3 weeks ago

What kind of support are you looking for?

viiika commented 3 weeks ago

What kind of support are you looking for?

Just to integrate the related pipeline, schedule, transformer backbone to diffusers, so that if one wants to inference with Meissonic, simply by importing diffusers instead of cloning the source code.

protimroy commented 3 weeks ago

I can attempt (:

viiika commented 3 weeks ago

I can attempt (:

Wow, thx a lot! Feel free to add your name on the first line of the copyright if you’d like. If you need further assistance or clarification, please let me know.

viiika commented 3 weeks ago

I can attempt (:

I submitted the relevant codes in https://github.com/huggingface/diffusers/pull/9875.

protimroy commented 3 weeks ago

ok