-
### Question
Here is my training script:
```shell
torchrun --nnodes=1 --nproc_per_node=8 --master_port=25001 \
llava/train/train_mem.py \
--model_name_or_path llama-vicuna-7b-v1.1 \
…
-
### Describe the issue
Issue:
Thanks to your contribution to the work! I met some issues while training.
I want to finetune the llava-llama2-7b-chat model on custom data. I used a two step finetuni…
-
**Describe the bug**
when train [llama-vid](https://github.com/dvlab-research/LLaMA-VID) (stage2, full-finetuning LLaMA) using deepspeed==0.14.0, and transformers trainer, grad_norm will be nan (or 1…
-
What can this application actually do more than the browser-based version of GPT.
Where there is an advantage ?
-
Hi I just saw in redis that there is a llava model based on llama-3, can be added it to the library? Thanks
Source:https://www.reddit.com/r/LocalLLaMA/comments/1ca8uxo/llavallama38b_is_released/
-
### Model description
LaVIN is a vision-language instructed model that is affordable to train (it was trained in a few hours on 8 A100 GPUs) with good performance on ScienceQA.
I'd like to add …
-
### Bug Description
Hello,
While I was trying to use the Weaviate Vector Store, I found that when I try to insert a Document with metadata to it, then it is not actually inserted into the vector s…
-
### Prerequisites
- [X] I am running the latest code. Mention the version if possible as well.
- [X] I carefully followed the [README.md](https://github.com/ggerganov/llama.cpp/blob/master/README.md)…
-
When launching Yuna, I get the following stderr output:
_Hardware accelerator e.g. GPU is available in the environment, but no `device` argument is passed to the `Pipeline` object. Model will be on C…
-
Hi 👋🏻 Do you have any inference examples that I could use?