fsdp Search Results - Githubissues

1000+ results
for fsdp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/xla #3811

Autograd discrepancy in `nn.Linear` (`torch.nn.functional.li…

## 🐛 Bug There seems to be a discrepancy (in addition to https://github.com/pytorch/xla/issues/3718) in how `torch.nn.Linear` (`torch.nn.functional.linear`) is implemented and dispatched between th…

ronghanghu updated 1 year ago
9
OpenGVLab/LLaMA-Adapter #17

what about gpuduring training? I use 16G*4, batch size to 1,…

pengwei-iie updated 1 year ago
2
Lightning-AI/litgpt #449

QLoRA for Multi-GPU Settings

Currently, we disabled Multi-GPU support for QLoRA because we didn't test it, yet. Might be worthwhile looking into this some time, so this issue is just to remember to revisit this.

rasbt updated 10 months ago
4
FlagOpen/FlagEmbedding #919

Training with unsloth

Currently, Unsloth can only support single GPU training, how can you implement it with 8-GPU training? Thx

ZetangForward updated 3 months ago
4
FasterDecoding/Medusa #95

Medusa Training Loss

When utilizing Axolotl, the training loss reduces to 0 following the gradient accumulation steps. Is this expected behaviour? With Torchrun, the training loss consistently remains NaN. Thank…

TomYang-TZ updated 5 months ago
5
pytorch/pytorch #97872

nn.linear not support bfloat16

### 🐛 Describe the bug from torch.distributed.fsdp import ( FullyShardedDataParallel as FSDP, MixedPrecision, BackwardPrefetch, ShardingStrategy, FullStateDictConfig, St…

Nightbringers updated 1 year ago
2
Alpha-VLLM/LLaMA2-Accessory #199

Tensor must be cuda and dense

hello, when run main_finetune.py till 238th row: for param in fsdp_ignored_parameters: dist.broadcast(param.data, src=dist.get_global_rank(fs_init.get_data_parallel_group(), 0), …

bibibabibo26 updated 4 months ago
1
pytorch/pytorch #89884

[RFC] PyTorch Tensor Parallel(TP) User API for Distributed T…

### 🚀 The feature, motivation and pitch # 🚀 Feature Provide a detailed API design for high-level PyTorch Tensor Parallelism API design. This is an evolvement of PyTorch Sharding introduced in ht…

fduwjj updated 6 months ago
5
lm-sys/FastChat #3054

Train with system prompt

When using: ``` torchrun --nproc_per_node=2 --master_port=20001 fastchat/train/train.py \ --model_name_or_path lmsys/vicuna-7b-v1.5 \ --data_path data/dummy_conversation.json \ --bf…

christobill updated 7 months ago
2
huggingface/blog #1042

error with ''from transformers import Seq2SeqTrainingArgumen…

I would greatly appreciate your help with this error. Here is [tutorial] (https://huggingface.co/blog/fine-tune-whisper) that i followed. Thanks in advance. ''from transformers import Seq2SeqTrai…

freeexit2002 updated 12 months ago
1

上一页 1...56 57 58 59 60 61 62...100 下一页

1000+ results for fsdp

1000+ results
for fsdp