-
### Bug description
Using DeepSpeed Zero 2 with certain models fails to properly save and reload the model checkpoint after conversion to the Lightning format.
In the provided example, several …
-
Dataloader name: `filipino_slang_norm/filipino_slang_norm.py`
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?filipino_slang_norm
| Dataset| filipino_slang_norm |
|---------…
-
## I rewrite **EVA/EVA-master-project/EVA-02/det/tools/deploy/export_model.py** to use LazyConfig to read **EVA/EVA-master-project/EVA-02/det/projects/ViTDet/configs/eva2_o365_to_coco/eva2_o365_to_co…
-
### Description
This group defined a number of [projects/goals for 2023](https://github.com/nodejs/package-maintenance/issues/550) & one of them was to create a new "statusboard" (similar to npm's …
-
### 软件环境
```Markdown
- paddlepaddle: N/A
- paddlepaddle-gpu: 2.5.1
- paddlenlp: 68bb39d
```
### 重复问题
- [X] I have searched the existing issues
### 错误描述
```Markdown
llama-13b 单机八卡A100 mp=8训练开启a…
-
Very impressed with the all new innovative architecture in Detr!
Can you clarify recommendations for training on a custom dataset?
Should we build a model similar to demo and train, or better to use…
-
### bug描述 Describe the Bug
![image](https://github.com/PaddlePaddle/Paddle/assets/32234672/92749744-e03a-498f-bb5d-42ca0819d891)
```set -x
SCRIPT_HOME=$(cd $(dirname $0); pwd)
CARDS="0,1,2,3…
-
2024-04-07 18:06:31,686 INFO: Loading BasicPBC model from ckpt/basicpbc.pth, with param key: [params_ema].
2024-04-07 18:06:31,771 INFO: Model [PBCModel] is created.
2024-04-07 18:06:31,771 INFO: Te…
-
Hi, I have downloaded the Mp3D dataset and I am trying to run train.sh file using a single GPU with small batch size =2 for debugging.
This is my train.sh:
```
python train.py --batch-size 2 --f…
-
### System Info
- `transformers` version: 4.36.2
- Platform: Linux-4.18.0-147.mt20200626.413.el8_1.x86_64-x86_64-with-glibc2.17
- Python version: 3.8.18
- Huggingface_hub version: 0.20.3
- Safete…