llama-3-llava Search Results

962 results
for llama-3-llava

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #102434

FSDP very slow on multi-node training

### 🐛 Describe the bug When I try to train model using torch.distributed.FullyShardedDataParallel, I found that : when training using single-node multi-gpu (1x8A100), the training speed is normal.…

JulioZhao97 updated 7 months ago
28
haotian-liu/LLaVA #326

[Question] Some Questions about the datasets used and the mo…

### Question 1, In my understanding, the first pretraining stage uses either the CC-3M Concept-balanced 595K dataset or the LAION/CC/SBU BLIP-Caption Concept-balanced 558K dataset. The second stage u…

duchenzhuang updated 1 year ago
1
haotian-liu/LLaVA #348

How to load LLaVA on a server with no Internet connection?

### When did you clone our code? I cloned the code base after 5/1/23 ### Describe the issue I manually download the pre-trained model at my path, here, which click the download button for each. ![…

gnimyang updated 1 year ago
10
Significant-Gravitas/AutoGPT #15

Auto-GPT Recursive Self Improvement

## Idea 💡 The **ULTIMATE** achievement for this project would be if Auto-GPT was able to recursively improve itself. That, after-all, is how AGI is predicted by many to come about. ## Suggestion …

Torantulino updated 2 months ago
271
dvlab-research/LISA #33

loading checkpoint error

I can load checkpoint correctly if I run train_ds.py, but when I use deepspeed as the given example, this error occurs. Can you tell me how to fix it? You are using the legacy behaviour of the . This…

alpacaduby updated 1 year ago
3
haotian-liu/LLaVA #281

--model_name_or_path in the training workflow : should it b…

### When did you clone our code? I cloned the code base after 5/1/23 ### Describe the issue Issue: scripts/deepspeed/finetune_lora.sh I think in training workflow `--model_name_or_path` should not…

YerongLi updated 1 year ago
2
haotian-liu/LLaVA #24

pretrain error

![image](https://user-images.githubusercontent.com/22076188/233404474-9d0977c7-c374-4aae-b673-06fabccb0466.png)

paulpaul91 updated 1 year ago
31
haotian-liu/LLaVA #264

[Usage] llava:apply_delta is killed

### When did you clone our code? I cloned the code base before 5/1/23, but have pulled the latest code base ### Describe the issue Issue: After download done, the script of apply_delta is killed …

k1e3v1i4n updated 1 year ago
1
haotian-liu/LLaVA #252

[Usage] Floating point exception when following fine-tuning …

Thanks for the awesome repo and the exciting progress on multimodal learning. Looking forward to trying out the model and building off of it, but having some issues getting started with fine-tuning my…

jpgard updated 1 year ago
5
haotian-liu/LLaVA #223

[Usage] ImportError: cannot import name 'mkdir_exists_ok' fr…

### When did you clone our code? I cloned the code base after 5/1/23 ### Describe the issue Issue: Command: ``` torchrun --nnodes=1 --nproc_per_node=4 --master_port=25001 \ llava/train/tr…

LetsGoFir updated 1 year ago
1

上一页 1...91 92 93 94 95 96 97...97 下一页

962 results for llama-3-llava

962 results
for llama-3-llava