qlora Search Results - Githubissues

1000+ results
for qlora

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

axolotl-ai-cloud/axolotl #1813

Mistral Nemo 12B training CUDA Out of memory only when enabl…

### Please check that this issue hasn't been reported before. - [X] I searched previous [Bug Reports](https://github.com/axolotl-ai-cloud/axolotl/labels/bug) didn't find any similar reports. ### Exp…

Nero10578 updated 1 month ago
4
jianzhnie/LLamaTuner #8

8bit和4bit训练效果对比有吗

如题，想知道8bit和4bit qlora效果有没有差别？

kongjiellx updated 1 year ago
1
eosphoros-ai/DB-GPT-Hub #61

Failure to repeat evaluation results.

I used: train_qlora.py to fine-tuned the model for llama 2-7b, and then used get_predict_qlora.sh ( the checkpoint is 10000) to get the results, but many of the outputs are empty, as shown below: …

dingtian123 updated 1 year ago
2
yangjianxin1/Firefly #256

训练llama3-8b-it报错

我严格按照README安装了相关的包 pip install -r requirements.txt pip install git+https://github.com/unslothai/unsloth.git pip install bitsandbytes==0.43.1 pip install peft==0.10.0 pip install torch==2.2.2 pip…

wx971025 updated 5 months ago
2
huggingface/trl #2215

[GKD] mismatch in tensors when stacking log probs

### System Info Latest TRL from source, can't run TRL env rn as cluster is shut down but I'm installing everything from source. If required will restart cluster and run. ### Information - [ ] Th…

nivibilla updated 1 month ago
3
google/gemma.cpp #198

Gemma.cpp hangs on a Gemma 7B model that was finetuned using…

Hi, thanks for the interesting project! I create Gemma 7B based model [webbigdata/C3TR-Adapter](https://huggingface.co/webbigdata/C3TR-Adapter). This model is Huggingface transformer format and …

webbigdata-jp updated 3 months ago
13
eosphoros-ai/DB-GPT-Hub #71

运行脚本get_predict_qlora.sh的时候，如何传入模型地址

Traceback (most recent call last): File "/root/xx/DB-GPT-Hub/predict_qlora.py", line 233, in dataset_name, result = predict() File "/root/xx/DB-GPT-Hub/predict_qlora.py", line 109, in pred…

Matter-Charles updated 1 year ago
1
shm007g/LLaMA-Cult-and-More #4

parallel training and param efficient

## Typology of Efficient Training - Data & Model Parallel - Data Parallel - Tensor Parallel - Pipeline Paralle - Zero Redundancy Optimizer(ZeRO) (DeepSpeed, often work with CPU offloadi…

shm007g updated 1 year ago
3
intel-analytics/ipex-llm #9622

Issue to run Qlora finetuning with OneAPI 2024.0.0

When I run the qlora example with the OneAPI 2024 installed, it prompts an error where libsycl.so.7 is not found. ``` warnings.warn( 0%| | 0/20…

xiangyang-95 updated 11 months ago
3
intel-analytics/ipex-llm #11167

finetune chatGLM6B using LoRA on arc

we are trying to finetune chatGLM6B using LoRA on arcA770 1card and 2cards , use the following command 1card: ``` python ./alpaca_lora_finetuning.py \ --base_model "/home/intel/models/chat…

YongZhuIntel updated 4 months ago
18

上一页 1...22 23 24 25 26 27 28...100 下一页

1000+ results for qlora

1000+ results
for qlora