jy0205 / Pyramid-Flow

Code of Pyramidal Flow Matching for Efficient Video Generative Modeling
https://pyramid-flow.github.io/
MIT License
2.43k stars 238 forks source link

Image generated video can only be selected at 384p #172

Open wx331406 opened 2 weeks ago

wx331406 commented 2 weeks ago

Why I can't select text image to video selection 1024p, only default selection 384p

feifeiobama commented 2 weeks ago

This is because we have only uploaded 1024p image and 384p video checkpoints for pyramid-miniflux. The 768p video checkpoint of pyramid-miniflux will be released very soon.

wx331406 commented 2 weeks ago

Sorry, maybe I'm not clear enough, I'm using the pyramidal-miniflux model to generate video for images, when I open the browser interface I found that the image to video can only select 384, and there is no 1024 image to video option, is my startup is not right or do I need to modify the code to be able to do it? Thank you very much for your patience.

feifeiobama commented 2 weeks ago

Sorry, we have not updated the Gradio demo to include text-to-image generation. Please use this Jupyter notebook to test text-to-image capability.

wx331406 commented 2 weeks ago

Thank you very much for your patience!