-
> Updated
> - 2023.7.13: 增加 baichuan-13B-Chat、InternLM 模型
> - 2023.6.25: 增加 ChatGLM2-6B、Vicuna-33B-v1.3 模型
> - 2023.6.24: 增加 MPT-30B/MPT-30B-Chat 模型
## 模型推理
建议使用通用的模型推理工具包运行推理,一般都提供较好的UI以及…
-
As a Windows user, I tried to compile this and found the problem was on these two files "```flash_fwd_launch_template.h```" and "```flash_bwd_launch_template.h```". below "```./flash-attention/csrc/fl…
-
A new set of 7b foundational models that claim to beat all 13b Llama 2 models in benchmarks.
https://huggingface.co/mistralai/Mistral-7B-v0.1
https://huggingface.co/mistralai/Mistral-7B-Instruct-v…
-
Is it possible to provide an API the mimics the functionality of the OPENAI API?
-
### Environment
🪟 Windows
### System
windows 10
### Describe the problem
making the inital install the program runs properly.
however on subsequent loads it then has this `error: Mod…
-
I try to use Koboldcpp's OpenAI compatible API in the Custom Local (OpenAI format) section, but it is not working. I input the model name, protocol and the port number. Please let me know if you need …
-
Hi, nice tool!
The installation went flawlessly.
I then tried the "Research" stream and it returns the error `AttributeError: 'NoneType' object has no attribute 'split'`.
Here is the bash console…
-
is there a way to run with AMD GPU ?
-
**Description**
For now, only the last system prompts are used on generating response while other previous prompts are ignored. As official OpenAI api supports multiple system prompts for input, su…
-
Offline LLMs + online browsing if available is a use case for private agents.
GPT4All.io has an easy installer and runs on CPU on most PCs.
Vicuna https://vicuna.lmsys.org - GPT-4 with ~90% Chat…
vRobM updated
10 months ago