-
近段时间 Apple silicon 平台的机器学习支持在开源社区的努力下快速发展,而苹果芯片的统一内存架构也让大模型的落地普及有了新的希望,因此本项目也推送了新版代码,增加对 macOS GPU 加速框架 MPS(Metal Performance Shaders)的支持。
不过 Apple silicon 诞生不到三年,其机器学习生态更是刚刚起步,一定存在许多问题。本 issue 用于记…
-
### Describe the bug
I am not able to generate text using AWQ models since i updated.
I am able to load the model but once I write something no text gets returned and the console displays an Attr…
-
Hi, @vasilecampeanu !
While GPT-* models are awesome, there's an issue of privacy arising from using an external third party API, especially when the information may be sensitive. More than that, t…
-
Hi,
Thanks to the great work of the authors of AWQ, maintainers at [TGI](https://github.com/huggingface/text-generation-inference), and the open-source community, AWQ is now supported in TGI ([link…
-
# Trending repositories for C#
1. [**jasontaylordev / CleanArchitecture**](https://github.com/jasontaylordev/CleanArchitecture)
__Clean Architecture Solution Template for ASP.NET …
-
I'm not sure how important it is to support Windows (how many Lean users actually use Windows?)
-
# Trending repositories for C#
1. [**veler / DevToys**](https://github.com/veler/DevToys)
__A Swiss Army knife for developers.__
182 stars today | 15,286 stars | 821 fork…
-
This would be useful to use apps other than ollama, as there are tons of backend apps and servers that are openai api compatible.
-
## Problem
Hello, I'm getting this weird cublasLt error on a lambdalabs H100 with cuda 118, pytorch 2.0.1, python3.10 Miniconda while trying to fine-tune a 3B param open-llama using LORA with 8bit …
-
- [ ] [Guide to choosing quants and engines : r/LocalLLaMA](https://www.reddit.com/r/LocalLLaMA/comments/1anb2fz/comment/kprbduc/)
# Guide to choosing quants and engines : r/LocalLLaMA
**DESCRIPTIO…