bmtrain Search Results - Githubissues

153 results
for bmtrain

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

FlagAI-Open/FlagAI #371

[FAQ] How to use Aquila ？如何通过 FlagAI 使用 Aquila 系列模型？

Aquila2 7B、34B 仓库地址：https://github.com/FlagAI-Open/Aquila2 ## 一、安装部署 ### Q: 模型权重会持续更新吗？会的，请注意查看变更日志：[中文](https://github.com/FlagAI-Open/FlagAI/blob/master/examples/Aquila/changelog_zh.md) / …

siyu-hu updated 10 months ago
11
OpenBMB/CPM-Bee #73

微调报错CUDA error: CUBLAS_STATUS_NOT_INITIALIZED when calling `…

当设置--max-length 64时，运行不报错，但results目录下也没有结果，当设置--max-length 128时有报错。单机1张3080 Ti卡，其中机器环境： ``` torch 1.13.1+cu117 bmtrain 0.2.1 NVIDIA-SMI 515.105.01 Driver …

c2j updated 1 year ago
1
OpenBMB/MiniCPM-V #320

finetune on NPUs

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

EasonXiao-888 updated 1 month ago
3
OpenBMB/ModelCenter #41

[BUG] llama outputting random gibberish

**Describe the bug** I used a verified LLaMA 7B hg checkpoint, and used a single thread bmb to do inference. But the output are just random gibberish. Not sure why? **Minimal steps to reproduce…

w32zhong updated 1 year ago
1
demerzel-iv/bmtrain_mindspore #5

关于StubTensor

StubTensor和普通的Tensor有什么区别

demerzel-iv updated 2 months ago
1
pypi/support #2933

Project Limit Request:BMTrain - 20 GB

### Project URL https://test.pypi.org/project/bmtrain/0.2.2/ ### Does this project already exist? - [X] Yes ### New limit 20 ### Update issue title - [X] I have updated the title.…

MayDomine updated 2 months ago
5
demerzel-iv/bmtrain_mindspore #3

动态内存分配问题

临时解决方法：设置环境变量（此环境变量正式版会改）： MS_DEV_ENABLE_ASCEND_VMM=1 MS_DEV_ASCEND_VMM_ALIGN_SIZE="2MB" 使用限制： 1. 可能导致第一个step时间变长 2. 不支持GE（动态图不涉及） 3. 碎片整理时可能会影响执行性能

lvyufeng updated 2 months ago
3
OpenBMB/Tell_Me_More #1

TypeError: init_distributed() got an unexpected keyword argu…

I am trying to reproduce the same result. I am using a server with 8 GPU, 32 GB each. when I try to run sft.sh script I got this error. File "/Mistral-Interact/src/sft.py", line 163, in initialize…

MohamedAdelNaguib updated 5 months ago
2
Raincleared-Song/sparse_gpu_operator #7

请问训练的代码有么？

还有文中的Vanilla ReLU和Shifted ReLU在哪里可以获得呀? 希望能够解答~

lvlu911 updated 5 months ago
4
huggingface/transformers #31371

TypeError: Block.forward() got an unexpected keyword argumen…

### System Info Copy-and-paste the text below in your GitHub issue and FILL OUT the two last points. - `transformers` version: 4.41.2 - Platform: Linux-6.5.0-26-generic-x86_64-with-glibc2.35 - P…

kunling-cxk updated 1 month ago
4

上一页 1...1 2 3 4 5 6 7...16 下一页

153 results for bmtrain

153 results
for bmtrain