large-vision-language-model Search Results

1000+ results
for large-vision-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jy0205/LaVIT #20

Training details of adding visual vocabs.

Thanks for your excellent work :) We admire the potential of vision-language models with unified vocabularies, and wish if some templates of the training code can be released for academic experiments…

martian422 updated 2 months ago
1
FailSpy/abliterator #14

Model requests

This is an issue to collect requests for model abliterations. No one is required to abliterate your request, but it does make for a good place to check if someone else has used this process on the…

FailSpy updated 1 month ago
3
huggingface/transformers #29823

Mixture of All Intelligence (MoAI)

### Model description A new large language and vision model (LLVM) that uses auxiliary visual information and natural language for prediction. It uses 2 modules: 𝙈𝙤𝘼𝙄-𝘾𝙤𝙢𝙥𝙧𝙚𝙨𝙨𝙤𝙧 and 𝙈𝙤𝘼𝙄-𝙈𝙞𝙭𝙚𝙧. He…

Dev-Khant updated 2 months ago
4
MasaiahHan/CorrelationQA #1

CorrelationQA

Hi, Congrats on the impressive work! Our paper FAITHSCORE: Evaluating Hallucinations in Large Vision-Language Models is very related to your topic. I wonder if you would mind adding our work to your…

LiqiangJing updated 3 months ago
1
PCIResearch/TransCore-M #2

Unknown vision tower: ./PCIResearch/TransCore-M/./openai/cl…

CUDA_VISIBLE_DEVICES=0 python3 inference.py --model-path ./PCIResearch/TransCore-M --vision-path ./openai/clip-vit-large-patch14-336 You are using a model of type transcorem to instantiate a model of…

zhanghaobucunzai updated 5 months ago
1
paperswithlove/papers-we-read #31

What matters when building vision-language models?

## LINKs [[paper](https://arxiv.org/abs/2405.02246)](https://arxiv.org/abs/2405.02246) [[models](https://huggingface.co/HuggingFaceM4/idefics2-8b)](https://huggingface.co/HuggingFaceM4/idefics2-8b)…

runhani updated 1 month ago
1
cambrian-mllm/cambrian #12

【bug】can not load cambrian-34b

in load_pretrained_model model = CambrianLlamaForCausalLM.from_pretrained( File "/usr/local/lib/python3.10/dist-packages/transformers/modeling_utils.py", line 3531, in from_pretrained ) =…

CSEEduanyu updated 4 days ago
16
LYX0501/DiscussNav #1

Real world robot code

Thanks for your excellent work. Could you provide the code for the robot in a real-world environment?

wang-zixu updated 3 weeks ago
8
irthomasthomas/undecidability #769

Yi Model Family: Powerful Multi-Dimensional Language and Mul…

- [ ] [Title: "Yi Model Family: Powerful Multi-Dimensional Language and Multimodal Models"](https://arxiv.org/html/2403.04652v1) # Title: "Yi Model Family: Powerful Multi-Dimensional Language and Mul…

irthomasthomas updated 3 months ago
1
AILab-CVC/YOLO-World #120

Empowering X-AnyLabeling with YOLO-World Model Support

Hi, @wondervictor, a huge shoutout for your remarkable contributions! I've seamlessly integrated YOLO-World into [X-AnyLabeling](https://github.com/CVHub520/X-AnyLabeling), marking a significant ad…

CVHub520 updated 3 months ago
3

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for large-vision-language-model

1000+ results
for large-vision-language-model