h2oai / h2ogpt

Private chat with local GPT with document, images, video, etc. 100% private, Apache 2.0. Supports oLLaMa, Mixtral, llama.cpp, and more. Demo: https://gpt.h2o.ai/ https://gpt-docs.h2o.ai/
http://h2o.ai
Apache License 2.0
11.4k stars 1.25k forks source link

Visual Model integration #774

Closed pseudotensor closed 10 months ago

pseudotensor commented 1 year ago

llama-2 w/ vision: https://github.com/haotian-liu/LLaVA https://www.youtube.com/watch?v=RxBSmbdJ1I8

https://minigpt-4.github.io/ https://huggingface.co/spaces/Vision-CAIR/minigpt4

pseudotensor commented 1 year ago

https://github.com/rom1504/img2dataset

https://arxiv.org/abs/2304.10592 https://minigpt-4.github.io/ https://github.com/Vision-CAIR/MiniGPT-4/blob/main/dataset/README_1_STAGE.md https://github.com/Vision-CAIR/MiniGPT-4 https://www.youtube.com/watch?v=__tftoxpBAw https://huggingface.co/datasets/Vision-CAIR/cc_sbu_align https://huggingface.co/Vision-CAIR/MiniGPT-4 https://huggingface.co/spaces/Vision-CAIR/minigpt4

https://github.com/salesforce/LAVIS/tree/main/projects/blip2

pseudotensor commented 1 year ago

https://github.com/turingmotors/heron/tree/main

pseudotensor commented 1 year ago

https://github.com/NExT-GPT/NExT-GPT

pseudotensor commented 12 months ago

https://github.com/voxel51/voxelgpt/tree/main

pseudotensor commented 12 months ago

https://github.com/langchain-ai/langchain/blob/master/cookbook/Semi_structured_multi_modal_RAG_LLaMA2.ipynb

pseudotensor commented 12 months ago

https://github.com/gradio-app/gradio/issues/5055 https://llava.hliu.cc/

pseudotensor commented 11 months ago

https://github.com/danny-avila/LibreChat

pseudotensor commented 11 months ago

https://github.com/THUDM/CogVLM

pseudotensor commented 10 months ago

done