large-vision-language-model Search Results

1000+ results
for large-vision-language-model

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

THUDM/ImageReward #71

Training Novel Concepts

Is there a way to train novel concepts into your blip model, like the way that textual inversions work for stable diffusion image generation? If so is there a training script provided or would one nee…

adamgeddon1686 updated 7 months ago
1
vitalets/github-trending-repos #109

New daily trending repos in Jupyter Notebook

Subscribe to this issue and stay notified about new [daily trending repos in Jupyter Notebook](https://github.com/trending/jupyter-notebook?since=daily).

vitalets updated 13 minutes ago
20
haotian-liu/LLaVA #1122

[Usage] Error loading v1.6 models

### Describe the issue Issue/Error: Loading 1.5 models works fine, but loading 1.6 models yield the error below. Note that the 1.6 models do load (despite the error) and inference works. However, tr…

RonanKMcGovern updated 1 month ago
27
bigshanedogg/survey #21

[FROZEN] Multimodal Few-Shot Learning with Frozen Language M…

## Problem statement 1. Despite the impressive capabilities of large scale language models, the potential to modalities has not been fully demonstrated other than text. 2. Aligning parameters of vi…

bigshanedogg updated 2 years ago
1
irthomasthomas/undecidability #628

LLaVA/README.md at main · haotian-liu/LLaVA

- [ ] [LLaVA/README.md at main · haotian-liu/LLaVA](https://github.com/haotian-liu/LLaVA/blob/main/README.md?plain=1) # LLaVA/README.md at main · haotian-liu/LLaVA ## 🌋 LLaVA: Large Language and Vi…

irthomasthomas updated 7 months ago
1
cambrian-mllm/cambrian #58

Cannot run inference with Cambrian-1-34B on multi-GPU

- I am trying to run inference with Cambrian-1-34B. - I have RTX 6000 GPUs with 48GBs. - I following [this inference script](https://github.com/cambrian-mllm/cambrian/blob/main/inference.py). The…

Lopa07 updated 1 month ago
3
NVlabs/VILA #135

ValueError: The checkpoint you are trying to load has model …

I just follw the step, but when I run the following code : # Load model directly from transformers import AutoModel model = AutoModel.from_pretrained("Efficient-Large-Model/Llama-3-VILA1.5-8B") …

eternal8080 updated 1 week ago
12
OpenMotionLab/MotionGPT #10

Why T5 is used instead of GPT?

It seems GPT like llama2 is more popular. But the paper still use T5. Compared to GPT, does it have any special advantages to use T5?

zhuxy12022 updated 5 months ago
3
Lightning-AI/lightning-thunder #343

Support NeMo NeVA Model

### 🚀 Feature NeMo's NeVa (LLaVa) is a multimodal language model Initial `examine`: `Found 49 distinct operations, of which 39 (79.6%) are supported` ### Work items - #145 (but looks like #…

athitten updated 1 hour ago
6
huggingface/trl #2005

Fine-tune large vision language model for chat completion on…

How to fine-tune a large vision language model such as Llava on the generated prompts only? The current [code](https://github.com/huggingface/trl/blob/main/examples/scripts/vsft_llava.py) is fine-tuni…

Liyan06 updated 3 weeks ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for large-vision-language-model

1000+ results
for large-vision-language-model