-
Hello,
After training, and after predicting the dataset images, I got IndexError. My dataset including 12 training and 3 test images and labels. The error message has been produced after 12nd imag…
-
Time Series Classification is a very popular machine learning problem.
You can find a full survey and empirical study ([link to paper](https://link.springer.com/article/10.1007/s10618-016-0483-9)) o…
-
Hello, thank you for sharing the source code. While trying to reproduce **SST2 task result with RoBERTa-base model**, I've encountered some questions regarding the hyper-parameters, lora_alpha, and a …
-
We need to research on what sort of analytics we can do in terms of tagging context, credibility of a source etc.
-
Hi,
Great work. Thanks for building this library. I am working on a life-long learning problem that tends to have a large number of data points, and thus a large kernel matrix.
It appears tha…
-
我的训练集数据量很大,有上百万,直接读取训练会OOM,所以使用streaming模式读取数据,但是发现训练速度很慢。
发现gpu的利用率很低
cpu直接被打满了
训练参数
```
SftArguments(train_type='sft', model_type='internvl2-8b', model_revision='master', full_deter…
-
## 🚀 Feature
torch.scatter_add will distribute values over an output tensor, summing if multiple values land in the same destination coordinate.
torch.logsumexp performs addition in linear space o…
fuzic updated
2 months ago
-
### Please check that this issue hasn't been reported before.
- [X] I searched previous [Bug Reports](https://github.com/OpenAccess-AI-Collective/axolotl/labels/bug) didn't find any similar reports…
-
## 环境准备 (Environmental Preparation)
```bash
# 安装ms-swift (Install ms-swift)
pip install git+https://github.com/modelscope/swift.git#egg=ms-swift[llm]
# 安装最新的transformers(Install the latest trans…
-
**Aim**
Find out what self-attention actually does (ie. benefits, limitations) and what research is already out there.
**Plan**
- [x] [Low-Rank and Locality Constrained Self-Attention for Sequence Mo…