-
I am attempting Multi-node training of Falcon 7B with Peft using DeepSpeed and Accelerate. During single node training, it takes up 39GB of GPU memory. However, in multi-node training, both machines c…
-
Below are some log messages in the log.txt.
patching file /Users/helen/ros_for_ios/log4cxx/apr-1.4.6/include/apr_general.h
patching file /Users/helen/ros_for_ios/log4cxx/apr-1.4.6/include/apr.h
patchi…
-
### Reminder
- [X] I have read the README and searched the existing issues.
### Reproduction
训练命令如下,其中参考了issue#1683的方法加上export NCCL_P2P_LEVEL=NVL也没用
#!/bin/bash
module load anaconda cudnn/8.6.0.…
-
Ich verwende Docker Desktop sowie Google Chrome und komme bei der Installation von Woltlab nicht weiter.
Aufgerufen habe ich: http://localhost:8080/install.php
folgende Punkte verhindern die Ins…
ps915 updated
4 months ago
-
A of the major issues with try/catch is that it's possible to catch the wrong error and _think_ that you are handling a mundane, expected problem, when in fact something far worse is wrong and the pro…
-
There will be times where I run all the blocks as usual without changing anything, and the WebUI block won't create a Gradio link. It would just say something along the lines of this:
Loading wei…
-
Hi when running accelerate launch with FSDP I run into the following error:
```
return torch.embedding(weight, input, padding_idx, scale_grad_by_freq, sparse)
RuntimeError: 'weight' must be 2-D
…
-
Started full-time thesis around april/may 2023.
Track DST, Q3/4 start. Still "seminar course" ToDo. Has superapp/MusicDAO experience. Discussed as diverse as digital Euro and Web3 search engine (un…
-
### System Info
- `transformers` version: 4.30.2
- Platform: Linux-5.19.0-46-generic-x86_64-with-glibc2.35
- Python version: 3.10.11
- Huggingface_hub version: 0.15.1
- Safetensors version: 0.3…
-
## 背景
飞桨官网的 API 文档,有中文和英文两个版本,都存储于 GitHub 中,并通过技术手段转为 HTML 文件后呈现在官网。
* `中文 API 文档`存储于 docs 仓库下,具体路径为 [PaddlePaddle/docs/docs/api/paddle](https://github.com/PaddlePaddle/docs/tree/develop/docs/api/padd…