fsdp Search Results - Githubissues

1000+ results
for fsdp

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

redhat-et/ilab-on-ocp #51

Switch to FSDP

FSDP is toolkit for distributed model training. It is an alternative to Deepspeed. The InstructLab team has added support for FSDP in addition to DeepSpeed in their training repo and we would like to …

MichaelClifford updated 1 day ago
5
aws-samples/awsome-distributed-training #445

FSDP Example ReadTimeoutError

``` 7: [rank80]: urllib3.exceptions.ReadTimeoutError: HTTPSConnectionPool(host='huggingface.co', port=443): Read timed out. (read timeout=10) ``` Running FSDP example, 16 p5 nodes. The example w…

nghtm updated 1 week ago
1
pytorch/torchtitan #562

Pipeline Parallelism + FSDP

On `PP + FSDP` and `PP + TP + FSDP`: - Is there any documentation on how these different parallelisms compose? - What are the largest training runs these strategies have been tested on? - Are there…

jeromeku updated 1 month ago
1
huggingface/accelerate #3061

FSDP miconfigurations

### System Info ```Shell Latest main version, torch nightly, cuda 12.6 ``` ### Information - [ ] The official example scripts - [ ] My own modified scripts ### Tasks - [ ] One of t…

evkogs updated 3 weeks ago
7
cisco-open/pymultiworld #88

Is FSDP/DDP supported ?

Hey, thanks for the great project. Very excited about using it. When doing the post-install I noticed that some internal torch distributed code seems to be patched and I was wondering what was the …

samsja updated 1 month ago
2
huggingface/accelerate #3139

FSDP, AMP, and Accelerate together

Hi, I'm wondering how I should be thinking of the mixed precision policies of these three packages together. My plugin is below. It works, but I don't think we're doing things right with the mixed_pre…

cinjon updated 4 days ago
3
facebookresearch/optimizers #24

Using Shampoo with Accelerate and FSDP

Hi, Does the Shampoo implementation support HuggingFace's Accelerate library? Can it be used in: `model, optimizer, scheduler = accelerator.prepare(model, optimizer, scheduler)` ? Thanks!

kfirgoldberg updated 4 days ago
2
facebookresearch/optimizers #23

Empty params in FSDP cause issue

Hi all, first of all, thanks for your great work! I have issue when trying to use the optimizer with FSDP training. The error is ` optimizer = DistributedShampoo( File "/root/slurm/src/opti…

odegeasslbc updated 1 week ago
1
hiyouga/LLaMA-Factory #3550

FSDP QDoRa

### Reminder - [x] I have read the README and searched the existing issues. ### Reproduction Is LLaMa-Factory capable of FSDP QDoRa described here: https://www.answer.ai/posts/2024-04-26-fsdp-qdor…

etemiz updated 2 days ago
13
meta-llama/llama-recipes #634

FSDP finetuned model inference question

### 🚀 The feature, motivation and pitch The fine-tuning with only FSDP works well and sharded checkpoints are saved as `__0_*.distcp, .metadata, and train_params.yam`l. I can see the loss drop reas…

mathmax12 updated 1 day ago
11

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for fsdp

1000+ results
for fsdp