-
mindnlp上有您的bge-m3的适配。如果您了解适配过程,请问pytorch的模型参数是如何转化为mindspore版本的?
-
在微调baichuan2-7B-Base模型的时候,发现,输入的token长度不能超过512,其官方给出的最大长度为4096。
微调遵循官方教程,使用lora的方式,在微调过程中使用官方数据集,在数据中添加超长数据时出现
Token indices sequence length is longer than the specified maximum sequence length for …
-
**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
I want to integrate paddlepaddle a…
-
### 🐛 Describe the bug
When I set the operator weights all to 1, the operator weights change as soon as an input is made to the operator.Before and after taking input to the operator, we output its…
-
## Environment
### Hardware Environment(`Ascend`/`GPU`/`CPU`):
> /device cpu
### Software Environment:
- **MindSpore version (source or binary)**: 1.5.2
- **Python version (e.g., Python 3.…
-
感谢你做的出色的工作 我在使用代码进行训练的时候发生了梯度爆炸的问题 这是我的日志!
epoch: 1 step: 1, loss is 9.914839744567871
epoch: 1 step: 2, loss is 11.918198585510254
epoch: 1 step: 3, loss is 10.370205350685865
epoch: 1 step: 4, l…
-
运行demo/predict.py 报错
self._context_handle.set_param(param, value)
RuntimeError: Unsupported device target Ascend. This process only supports one of the ['CPU']. Please check whether the Asce…
-
我运行了`sh examples/pangu_draw_v3/run_sampling.sh`,不能导入https://github.com/mindspore-lab/mindone/blob/3ef47e4176b193845c628fd3059cae2d4cff6559/examples/pangu_draw_v3/gm/modules/attention.py#L18 的FlashAtte…
-
## Environment
### Hardware Environment(`Ascend`/`GPU`/`CPU`): Ascend910A atlas800-9000 风冷
- **MindSpore version (source or binary)**: 2.1
- **Python version (e.g., Python 3.7.5)**: 3.8
- **…
-
## Environment
### Hardware Environment(`Ascend`/`GPU`/`CPU`):
`/device cpu`
### Software Environment:
- **MindSpore version**: `2.2.14` (`r2.2` branch)
- **Python version**: `3.…