-
Aquila2 7B、34B 仓库地址:https://github.com/FlagAI-Open/Aquila2
## 一、安装部署
### Q: 模型权重会持续更新吗?
会的,请注意查看变更日志:[中文](https://github.com/FlagAI-Open/FlagAI/blob/master/examples/Aquila/changelog_zh.md) / …
-
当设置--max-length 64时,运行不报错,但results目录下也没有结果,当设置--max-length 128时有报错。
单机1张3080 Ti卡,其中机器环境:
```
torch 1.13.1+cu117
bmtrain 0.2.1
NVIDIA-SMI 515.105.01 Driver …
-
### 是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?
- [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions
### 该问题是否在FAQ中有解答? | Is there an existing ans…
-
**Describe the bug**
I used a verified LLaMA 7B hg checkpoint, and used a single thread bmb to do inference.
But the output are just random gibberish. Not sure why?
**Minimal steps to reproduce…
-
StubTensor和普通的Tensor有什么区别
-
### Project URL
https://test.pypi.org/project/bmtrain/0.2.2/
### Does this project already exist?
- [X] Yes
### New limit
20
### Update issue title
- [X] I have updated the title.…
-
临时解决方法:
设置环境变量(此环境变量正式版会改):
MS_DEV_ENABLE_ASCEND_VMM=1
MS_DEV_ASCEND_VMM_ALIGN_SIZE="2MB"
使用限制:
1. 可能导致第一个step时间变长
2. 不支持GE(动态图不涉及)
3. 碎片整理时可能会影响执行性能
-
I am trying to reproduce the same result. I am using a server with 8 GPU, 32 GB each. when I try to run sft.sh script I got this error.
File "/Mistral-Interact/src/sft.py", line 163, in initialize…
-
还有文中的Vanilla ReLU和Shifted ReLU在哪里可以获得呀? 希望能够解答~
-
### System Info
Copy-and-paste the text below in your GitHub issue and FILL OUT the two last points.
- `transformers` version: 4.41.2
- Platform: Linux-6.5.0-26-generic-x86_64-with-glibc2.35
- P…