-
Hello, thanks for your nice work!
I am now having trouble reproducing the reported score on the VQA task. I evaluated the checkpoint downloaded from https://storage.googleapis.com/sfr-vision-languag…
-
### Feature request
How can we take advantage of https://github.com/haotian-liu/LLaVA ?
https://llava-vl.github.io/
### Motivation
> LLaVA represents a novel end-to-end trained large multimodal …
-
### Describe the bug
Hello, I'm trying to get the recently published [MultiAgent](https://github.com/microsoft/autogen/tree/gaia_multiagent_v01_march_1st/samples/tools/autogenbench/scenarios/GAIA/Tem…
-
### Model description
LaVIN is a vision-language instructed model that is affordable to train (it was trained in a few hours on 8 A100 GPUs) with good performance on ScienceQA.
I'd like to add …
-
**def convnextv2_large(pretrained=False, kwargs) -> ConvNeXt:
/root/autodl-tmp/hongan/NewFolder/LLaVA-HR/llava/model/multimodal_encoder/convnext.py:1091: UserWarning: Overwriting convnextv2_huge in r…
-
**Weekend Task**
- Research on theory behind Stable Diffusion
- List and research on the applications of Stable Diffusion
- Expand on the application.
- Which industry does this affect?
…
-
Subscribe to this issue and stay notified about new [daily trending repos in unknown languages](https://github.com/trending/unknown?since=daily)!
-
**Is your feature request related to a problem? Please describe.**
I currently run the encoder ONNX, get features then prepare things like `input_ids` and pass to another decoder ONNX multiple times.…
-
- [ ] [LoRA Land: Fine-Tuned Open-Source LLMs that Outperform GPT-4 - Predibase - Predibase](https://predibase.com/blog/lora-land-fine-tuned-open-source-llms-that-outperform-gpt-4)
# LoRA Land: Fine…
-
- I am trying to run inference with Cambrian-1-34B.
- I have RTX 6000 GPUs with 48GBs.
- I following [this inference script](https://github.com/cambrian-mllm/cambrian/blob/main/inference.py).
The…