bmtrain Search Results - Githubissues

153 results
for bmtrain

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FlagAI-Open/FlagAI #246

请问有没有模型的下载地址？[Question]

**Describe the question** A clear and concise description of what the question is. 您好，请问有没有模型的下载地址，代码下载的方法速度较慢，有没有百度云盘等模型下载链接，谢谢~ **Additional context** Add any other context about the question he…

lmolhw5252 updated 1 year ago
5
OpenBMB/BMTrain #62

如果一台机器上的显存不够加大载模型时，是否将加载到其他机器上？

您好，问题如下： 4台机器，每台2张2080ti(11G)，如果模型很大，一台机器加载不了，是否会通过模型并行加载到其他机器上？如果是，各节点是依次执行以下代码实现训练吗？ torchrun --nnodes=4 --nproc_per_node=2 --rdzv_id=1 --rdzv_backend=c10d --rdzv_endpoint=xxx.xxx.xxx.xx:88688} …

bucm-tcm-tool updated 1 year ago
3
thunlp/OpenDelta #39

does opendelta support gradient_checkpointing?

Thank you for the awesome work. I met some problems when using opendelta with gradient_checkpointing, it just throws: "RuntimeError: element 0 of tensors does not require grad and does not have a g…

hmzo updated 1 year ago
3
FlagAI-Open/FlagAI #334

[Question]: aquila-7B OOM

### Description 在32G GPU上跑aquila-7B推理的示例代码显示out of memory，请问需要多少显存？其他7B大模型是可以跑的，aquila模型的显存消耗会比较高吗？ ### Alternatives _No response_

calla212 updated 1 year ago
28
OpenBMB/BMTrain #50

Is there a docker that i can use bmtrain directly?

The installation fails with "RuntimeError: Error compiling objects for extension". Thank you!

drxmy updated 1 year ago
9
OpenBMB/ModelCenter #33

[BUG] Pretrained GPT2 model has an incorrect size compared w…

**Describe the bug** ``` File "example.py", line 303, in main() File "example.py", line 93, in main gpt = GPT2.from_pretrained("gpt2-base", config=gpt_config) File "/home/chenya…

alphaGem updated 2 years ago
5
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 4 months ago
1907
OpenBMB/BMTrain #31

Comparision with Google Jax & Nvidia Apex

Could BMTrain use together with the tools like Jax or Apex, or any comparisions or experiments plan with these tools? Thanks

Kunlun-Zhu updated 2 years ago
1
OpenBMB/CPM-Live #254

cpm ant++ tune训练之后infer不出想要结果

cpm ant++ 使用几十万级别的素材tune训练之后得到best.pt文件，但在推理阶段输入对应的input，得不到想要的结果，全是英文字符和---------这种符号，不知道哪块环节出了问题

touwenameng updated 1 year ago
23
OpenBMB/BMTrain #27

About DistributedDataloader

Hi, is there a DistributedDataloader design necessary to work with the bmtrain for the accelerating, or the bmtrain method itself would realize the optimization for both the memory and the speed?

Kunlun-Zhu updated 2 years ago
10

上一页 1...10 11 12 13 14 15 16...16 下一页

153 results for bmtrain

153 results
for bmtrain