-
Hello,
I've been trying to qwen2 0.5B and tinyclip using the repository, but I'm running into CUDA OOM issues on the dense2dense distillation step. Im running on 4 80GB A100s, I was wondering if I …
-
# URL
- https://arxiv.org/abs/2402.17764
# Affiliations
- Shuming Ma, N/A
- Hongyu Wang, N/A
- Lingxiao Ma, N/A
- Lei Wang, N/A
- Wenhui Wang, N/A
- Shaohan Huang, N/A
- Li Dong, N/A
-…
-
### 是否已有关于该错误的issue或讨论? | Is there an existing issue / discussion for this?
- [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions
### 该问题是否在FAQ中有解答? | Is there an existing ans…
-
How similar/dissimilar is this to LMTuner [[GitHub Repo](https://github.com/WENGSYX/LMTuner)] [[Paper](https://arxiv.org/abs/2308.10252)]?
-
Hello, great work on your project!
I'm trying to generate lyrics from a melody using the provided example in the Jupyter notebook. However, I'm struggling to get the lyrics in English, even when I …
-
# Architecture
This document outlines the architecture of the AI Nutrition-Pro application, including system context, containers, and deployment views. The architecture is depicted using C4 diagram…
-
# 平台(如果交叉编译请再附上交叉编译目标平台):
RK3588,本地编译
gcc版本11.4.0
g++版本
Ubuntu 22.04
# Github版本:
git clone https://github.com/alibaba/MNN.git
# 编译方式:
cmake..
make -j4
python build_deps.py opencl
py…
-
Hello,
Tensor assertion error is raised if you try to train the model. It starts with the following:
```bash
0%| | 0/10 [00:00
-
By limited by VRAM, I'm using unsloth to finetuned Qwen2 by following the notebook(https://colab.research.google.com/drive/1mvwsIQWDs2EdZxZQF9pRGnnOvE86MVvR?usp=sharing).
But I got these warnings f…
-
Thank you so much for your work. I'm trying to run `sh ./dbgpt_hub/scripts/export_merge.sh`, but getting the following error. Can you upload the latest script? Thanks.
```bash
Traceback (most recent…