bmtrain Search Results - Githubissues

153 results
for bmtrain

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

OpenBMB/BMCook #23

ImportError: /home/miniconda3/envs/BMCook/lib/python3.10/sit…

Hi, I encountered the error described in the title of this issue, while trying to run the gpt-2 example. Here is my command: ``` export CUDA_VISIBLE_DEVICES=7 torchrun --nnodes=1 --nproc_per_node=1…

wln20 updated 1 year ago
3
OpenBMB/CPM-Bee #39

请教，win10如何避免nccl编译问题

报错如下： nccl.obj : error LNK2001: 无法解析的外部符号 ncclCommInitRank 。。。 build\lib.win-amd64-cpython-39\bmtrain\nccl\_C.cp39-win_amd64.pyd : fatal error LNK1120: 15 个无法解析的外部命令。感谢。

Ren2018 updated 1 year ago
12
OpenBMB/BMTrain #193

[Feature] performance problem

### Is your feature request related to a problem? Please describe. 非常赞赏学长们的工作！我有一个小小的问题注意到readme里有一个吞吐和显存占用的表格。BMtrain显著优于Deepspeed- megaton，我好奇这其中的优化主要来源于什么地方呢。同样的逻辑，为什么我们能够支持更多的bach size，吞吐更高？是否也有显…

Xiang-cd updated 3 months ago
1
bytedeco/javacpp-presets #1395

[pytorch] how to Distributed train the model use javacpp…

Hi, now for the big model ,we need train model use many dirstribute machine, so in python version we could use distribute assert to declear train model in many machine ,but now in javacpp pytorch,…

mullerhai updated 3 months ago
6
OpenBMB/CPM-Bee #79

CPM Bee 微调时设置 half 出现 CUDA 报错，不设置 half 则 assert 报错

CPM 使用微调脚本训练，不开启 --use-delta 这一选项，则出现如下错误： Traceback (most recent call last): File "finetune_cpm_bee.py", line 503, in main() File "finetune_cpm_bee.py", line 499, in main finetune(…

YingLaiLin updated 1 year ago
3
OpenBMB/BMCook #27

int8量化感知训练，保存的模型依然是fp32

通过BMCook进行模型压缩，配置了quantization和distillation，训练的loss收敛的很好。但是保存模型的时候，发现保存的checkpoint文件并没有减少，分析发现线性层的参数还是fp32的。另外bmtrain的优化器AdamOffloadOptimizer和AdamOptimizer也只支持参数保存为fp32和fp16，并没有实现参数保存为int8。

jinmin527 updated 10 months ago
3
OpenBMB/CPM-Bee #83

预训练数据格式

运行pretrain_cpm_bee.sh脚本修改了dataset指定datasets.json ``` json [ { "dataset_name": "pretrain", "task_name": "mlm", "weight": 1.0, "path": "/home/litao/ScienGU…

ScienGU updated 11 months ago
4
OpenBMB/BMCook #21

TypeErrorTypeError: : AdamOptimizer.__init__() got an unexpe…

When I try to run the examples :`bash gpt/gpt2_test.sh` It fails and throws out the following errors: > File "/workspace/BMCook/examples/gpt/gpt2_test.py", line 84, in main …

Oran-Ac updated 1 year ago
1
OpenBMB/CPM-Bee #62

BMTrain不好安装能出一个具体的环境要求吗 ?

尝试了很多版本也不知道哪里出问题了反正就是安装不上有的时候提示torch没有有的时候有提示gcc 重新安装了数次我的是在dock环境里面折腾了2天了环境还没有配好万分感谢啦

xiaoguaishoubaobao updated 1 year ago
8
OpenBMB/CPM-Bee #75

微调时使用bmtrain加载模型报错 Error(s) in loading state_dict for CPMBee

使用finetune_cpm_bee微调时，基础模型加载不了 ### finetune_cpm_bee.sh中的参数如下： OPTS+=" --use-delta" OPTS+=" --model-config config/cpm-bee-1b.json" ... OPTS+=" --load cpm-bee-1b/pytorch_model.bin" ### 报错信息如下 …

baisuzi updated 1 year ago
1

上一页 1...1 2 3 4 5 6 7...16 下一页

153 results for bmtrain

153 results
for bmtrain