-
I pulled the latest docker image.
When i run the /opt/deploy.sh, it shows that the container lacks the "python-multipart" dependency. And it is hard to add this dependency because of the pyenv enviro…
-
哈喽作者大大,想请教下STAR+MLora具体是怎么用MLora替换的STAR的结构,看到仓库里好像没有对应的配置文件,
-
halo作者大大,如果想把您仓库中关于mlora部分的tensorflow代码改成使用pytorch框架的代码,同时使mlora适配其他的backbone models,求指教修改的顺序和思路 :)
-
I encountered an error when using the --trace option. The error message indicates the following:
> /u/.conda/envs/mlora/lib/python3.12/site-packages/bitsandbytes/autograd/_functions.py:322: UserWar…
-
why to commit the part of stop gradient of main backbone?
-
Your work is outstanding. I would like to ask which key code modules are used for the assembly of Lora modules.
Thank you for your reply. @yezhengmao1
-
17:23:50-923757 INFO Kohya_ss GUI version: v24.1.4
17:23:51-726265 INFO Submodule initialized and updated.
17:23:51-726265 INFO nVidia toolkit detected
17:23:53-757607 INFO Torch 2.…
-
Excuse me, can the training process only be implemented through mlora? But that doesn't match my own torch and transformer versions, is there a solution?
-
Is the frame work support multi-gpu training?
I want to use the frame work to train a 70B model, however, I did not find the parameter settings or methods for multi-gpus training.
-
模型能读取到,但不能初始化ChatGLM模型中一些LORA (Low-Rank Adaptation)适配器的权重。请问该怎么解决