monatis / lmm.cpp

Inference of Large Multimodal Models in C/C++. LLaVA and others
MIT License
46 stars 2 forks source link

WIP: Infer with LLaVA-RLHF #2

Open monatis opened 11 months ago

monatis commented 11 months ago

This is still WIP

After I implemented the GGUF support in clip.cpp, now it's time to combine clip.cpp + llama.cpp = llava.cpp (the first model to be supported in this repo).

For now, I copy CLIP conversion + model loading + inference code from clip.cpp and make necessary changes. In the future, these changes may be merged upstream and clip.cpp may be a submodule in this repo.

I think This is enough for the initial release. I will streamline the implementation afterwards.