-
I'm running Unsloth to fine tune LORA the Instruct model on llama3-8b .
1: I merge the model with the LORA adapter into safetensors
2: Running inference in python both with the merged model direct…
-
## 背景
众所周知,Paddle 是一个历史悠久的框架,使得 Paddle 能够久经考验,应对各种场景稳定运行。但历史的沉淀同样带来一个严重的问题,就是框架内 API 语义不清晰,多种 API 能够做同样或者类似的事情。得益于我们的公开 API 审查机制和 fluid 清理,公开 API 中类似问题较少,但框架内部仍存在大量历史遗留的内部 API 的使用,这些 API 的存在导致框架内部需…
-
I test the shared checkpoints without any tuning and get the following results. I am confused to see that pretrained-base performs better than finetuned-base on Slake. Also, as I set different seeds…
-
### Required prerequisites
- [X] I have read the documentation .
- [X] I have searched the [Issue Tracker](https://github.com/PKU-Alignment/omnisafe/issues) and [Discussions](https://github.com/PKU-A…
-
### Describe the bug
With the version of speechbrain being 1.0.0, the unchanged source code and the Mini LibriSpeech dataset, I worked according to the Google Colab (https://colab.research.google.c…
-
I'm using dreamer-v3 and it works pretty well at first. But when I do the following thing to the env:
- Increase the reward scale
- Increase the sequence length of the agent
- Increase the diffic…
-
Hi,
I'm trying to train your model with a dataset that only contains one integer variable (1 col x 22928 rows), but after the training (the model is saved) I obtain the following error:
```
Tra…
-
https://github.com/modelscope/swift/blob/main/docs/source_en/Multi-Modal/glm4v-best-practice.md
使用的是文档中的命令,eval_acc一直没有变化
# Experimental environment: A100
# 40GB GPU memory
CUDA_VISIBLE_DEVICE…
-
```
OS : win11-wsl2 cuda-12.1
GPU : RTX4070ti 12GB
AssertionError: An object named 'GANLoss' was already registered in 'loss' registry!
(cuda121) root@LZH5:/mnt/d/AI/A0J/231220/wsl_cuda121#…
-
I don't think anyone suggested this in the past, and I thought maybe a place to discuss this (and put bounties) would be a good start.
# Problem
The Minetest world is pretty big for the average playe…