-
**Describe the question**
A clear and concise description of what the question is.
您好,请问有没有模型的下载地址,代码下载的方法速度较慢,有没有百度云盘等模型下载链接,谢谢~
**Additional context**
Add any other context about the question he…
-
您好,问题如下:
4台机器,每台2张2080ti(11G),如果模型很大,一台机器加载不了,是否会通过模型并行加载到其他机器上?
如果是,各节点是依次执行以下代码实现训练吗?
torchrun --nnodes=4 --nproc_per_node=2 --rdzv_id=1 --rdzv_backend=c10d --rdzv_endpoint=xxx.xxx.xxx.xx:88688} …
-
Thank you for the awesome work.
I met some problems when using opendelta with gradient_checkpointing, it just throws:
"RuntimeError: element 0 of tensors does not require grad and does not have a g…
-
### Description
在32G GPU上跑aquila-7B推理的示例代码显示out of memory,请问需要多少显存?
其他7B大模型是可以跑的,aquila模型的显存消耗会比较高吗?
### Alternatives
_No response_
-
The installation fails with "RuntimeError: Error compiling objects for extension".
Thank you!
-
**Describe the bug**
```
File "example.py", line 303, in
main()
File "example.py", line 93, in main
gpt = GPT2.from_pretrained("gpt2-base", config=gpt_config)
File "/home/chenya…
-
微博内容精选
-
Could BMTrain use together with the tools like Jax or Apex, or any comparisions or experiments plan with these tools? Thanks
-
cpm ant++ 使用几十万级别的素材tune训练之后得到best.pt文件,但在推理阶段输入对应的input,得不到想要的结果,全是英文字符和---------这种符号,不知道哪块环节出了问题
-
Hi, is there a DistributedDataloader design necessary to work with the bmtrain for the accelerating, or the bmtrain method itself would realize the optimization for both the memory and the speed?