-
### Question
Thanks for the great work~
Also, it looks like A800 cannot enable flash-attn. (error screenshot below)
```
python \
llava/train/train.py \
--model_name_or_path /root/dev…
-
Hi,
I'm trying to run guidance within a Gradio app. I'm not sure about the async framework behind Gradio but it seems to be AnyIO and is using some worker threads.
The problem I am experiencing …
-
Hello,
I created a python script to do a retrieval QA from documentation written in markdown. Here is the working program:
```python
from langchain.document_loaders import DirectoryLoader
from…
-
### 提交前必须检查以下项目
- [X] 请确保使用的是仓库最新代码(git pull),一些问题已被解决和修复。
- [X] 我已阅读[项目文档](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/wiki)和[FAQ章节](https://github.com/ymcui/Chinese-LLaMA-Alpaca-2/wiki/常见问题)并且已…
-
![image](https://github.com/h2oai/h2ogpt/assets/74184102/f09ad7e1-fe6d-44fe-9603-575f525a526c)
Hello!Is there any improvement plan?
-
**Describe the bug**
Hi, everybody, I'm traning a llama model in step3 using deepspeed-chat. In version 0.10.1, it raised the following error([see in logs bleow](https://github.com/microsoft/DeepSp…
-
Setting CPU quantization kernel threads to 6
Using quantization cache
Applying quantization to glm layers
Traceback (most recent call last):
File "/usr/local/lib/python3.8/dist-packages/gradio/r…
-
### System Info
on Python 3.10.10
with requirements.txt
```
pandas==2.0.1
beautifulsoup4==4.12.2
langchain==0.0.229
chromadb==0.3.26
tiktoken==0.4.0
gradio==3.36.1
Flask==2.3.2
tor…
-
### Describe the bug
this is a continuation of https://github.com/oobabooga/text-generation-webui/issues/428
i'm following instruction for one click installer for macos
https://github.com/oobab…
-
I finally made it work
What other parameters and options do we have?
`instruct_pipeline = pipeline(model="F:/Dolly 2.0/dolly-v2-12b", torch_dtype=torch.bfloat16, trust_remote_code=True, device…