Paddle Multimodal Integration and eXploration, supporting mainstream multi-modal tasks, including end-to-end large-scale multi-modal pretrain models and diffusion model toolbox. Equipped with high performance and flexibility.
Apache License 2.0
359
stars
143
forks
source link
【PPMix No.4】 support LLaVA-OneVision and LLaVA-Critic, refine llava codes #796
Thanks for your contribution!