-
8-win_amd64.exp
nccl.obj : error LNK2001: 无法解析的外部符号 ncclCommInitRank
nccl.obj : error LNK2001: 无法解析的外部符号 ncclReduce
nccl.obj : error LNK2001: 无法解析的外部符号 ncclRecv
nccl.obj : …
-
when running text_generation, bmtrain error appeared:
`ImportError: /home/CPM-Bee/BMTrain/bmtrain/optim/_cuda.cpython-39-x86_64-linux-gnu.so: undefined symbol: _ZN2at4_ops9new_zeros4callERKNS_6Tensor…
-
Traceback (most recent call last):
File "finetune_cpm_bee.py", line 4, in
import bmtrain as bmt
File "/home/mdisk2/tanjunwen/anaconda3/envs/cpmbee2/lib/python3.8/site-packages/bmtrain/__in…
-
### Description
[INFO] bmtrain_mgpu.sh: hostfile configfile model_name exp_name exp_version
bmtrain_mgpu.sh: line 35: ifconfig: command not found
/home/edcuser/.conda/envs/pytorch_cuda117/lib/pytho…
-
def get_model_tokenizer(args):
model = LlamaForCausalLM.from_pretrained(args.model_name_or_path)
tokenizer = LlamaTokenizer.from_pretrained(args.model_name_or_path)
tokenizer.add_specia…
-
执行命令:
python setup.py install
报错信息:
The detected CUDA version (12.1) mismatches the version that was used to compile
PyTorch (11.8). Please make sure to use the same CUDA versions.
疑问:
目前p…
-
### Is your feature request related to a problem? Please describe.
from transformers import BloomConfig,BloomModel
import bmtrain as bmt
config = BloomConfig.from_pretrained('bloom/config.json')
…
-
finetune_cpm_bee.py迭代两步后,服务器内存(不是显存)占用急剧增加,直到占满报错,问题出在更新参数的时候:
File "/home/adax/projects/WebCPM/training/scripts/../finetune_cpm_bee.py", line 210, in finetune
optim_manager.step()
File "/h…
-
下面是报错信息
![image](https://github.com/OpenBMB/CPM-Bee/assets/21216881/ccbb767c-8728-4887-aae2-037d9b7ca126)
****
`Collecting torch=1.10
Downloading torch-1.13.1-cp39-cp39-manylinux1_x86_64.whl (…
-
python setup.py install
running install
running bdist_egg
running egg_info
writing bmtrain.egg-info\PKG-INFO
writing dependency_links to bmtrain.egg-info\dependency_links.txt
writing requirement…