baichuan-7b Search Results

openvinotoolkit/openvino.genai #273

[Good First Issue]: Verify baichuan2-7b-chat with GenAI text…

### Context This task regards enabling tests for **baichuan2-7b-chat**. You can find more details under openvino_notebooks [LLM chatbot README.md](https://github.com/openvinotoolkit/openvino_notebook…

p-wysocki updated 1 month ago

yangjianxin1/Firefly #63

单卡3090（24G)显存确定能训练吗？

为啥前面一哥们说报显存不够？

gogo03 updated 1 year ago

microsoft/DeepSpeed #5242

[BUG] grad_norm and loss is nan when deepspeed==0.13.5 but o…

**Describe the bug** when fine-tuning my model using deepspeed==0.13.5, and huggingface trainer, loss and grad_norm will be nan at step 2 ![image](https://github.com/microsoft/DeepSpeed/assets/29994…

Chandler-Bing updated 1 week ago

baichuan-inc/Baichuan2 #192

trying to run the demo code in readme, show AttributeError: …

when running the code from readme `tokenizer = AutoTokenizer.from_pretrained("baichuan-inc/Baichuan2-13B-Chat", use_fast=False, trust_remote_code=True)` got error: -----------------------------…

no7dw updated 1 year ago

baichuan-inc/Baichuan-7B #72

[Question] Lora微调训练的时候报错

### Required prerequisites - [X] I have read the documentation . - [X] I have searched the [Issue Tracker](https://github.com/baichuan-inc/baichuan-7B/issues) and [Discussions](https://github.com/bai…

wickedvalley updated 1 year ago

beyondguo/LLM-Tuning #54

按照教程,一步一步弄的,到了训练PPO的时候, 卡到 CUDA error: device-side assert tr…

Using pad_token, but it is not set yet. Loading base model for ppo training... 加载base 加载lora 加载ppo WARNING:root:A model is loaded from '/root/autodl-tmp/LLM/weights/sft_lora', and no v_head weig…

karl-tao-zhang updated 1 year ago

chenking2020/FindTheChatGPTer #9

支持中文，未做无毒等处理，能力最好的模型是谁？（大中小不同大小）

yuedajiong updated 1 year ago

intel-analytics/ipex-llm #9270

Issues track for running bigdl-llm on cpu/xpu with python3.1…

## Issue1 on xpu with python 3.10 [Fixed after releasing bigdl-core-xe and bigdl-core-xe-esimd for python 3.10] on Arc14, I followed https://github.com/intel-analytics/BigDL/blob/main/python/llm/exa…

liu-shaojun updated 1 year ago

ztxz16/fastllm #55

谁有跟llama.cpp /ggml做过性能对比？

如题

JianbangZ updated 1 year ago

jerry1993-tech/Cornucopia-LLaMA-Fin-Chinese #8

关于测试您的模型的问题

作者您好！我对您的工作非常感兴趣，同时我拿了您发布的权重，想要测试一下您的模型，结果发现效果并不是很理想，我问的是几个您在文档中有提及的问题，下面是我的测试记录 llama-7b模型本身就非常容易陷入胡说八道的情况，目前我也在做和您类似的工作，我用的是alpaca-7b的lora 微调算法，发现效果要远好于llama。同时扩充中文词汇量的工作也有人做过了，lora训练后的效果有大幅提升。不知…

jjyu-ustc updated 1 year ago

663 results for baichuan-7b

663 results
for baichuan-7b