-
Supporting a vector database like ChromaDB would have numerous benefits including a longer memory.
-
_Exclude Top Choices_ (XTC) sampling algorithm is a novel sampler that turns truncation on its head: Instead of pruning the least likely tokens, under certain circumstances, **it removes the _most_ li…
-
Any plans to allow integration or compatibility for GPT4ALL?
-
I couldn't reopen my original issue so I hope its fine if I open another bug.
The pascal fix is broken again, at least for me.
The following check does not work:
q4_matmul.cu:
> if defined(__…
-
> **Warning**. Complete **all** the fields below. Otherwise, your bug report will be **ignored**!
**Have you searched for similar [bugs](https://github.com/SillyTavern/SillyTavern/issues?q=)?**
Ye…
ghost updated
3 months ago
-
Hi. I am building a binding for AWQ in lollms and I get this problem after I install it:
ImportError: DLL load failed while importing awq_inference_engine
The error seems to come from this:
aw…
-
### Has this been raised before?
- [X] I have checked [the GitHub README](https://github.com/QwenLM/Qwen2.5).
- [X] I have checked [the Qwen documentation](https://qwen.readthedocs.io) and cannot fin…
-
Hello there!
I would like to bring your attention to this finding: https://github.com/oobabooga/text-generation-webui/issues/2987
It seems like it is possible to reduce the VRAM usage of AutoGPT…
-
I know this is usually unnecessary, but I'd like to help out. Do you have a discord?
I have some ideas about datasets, training that I think could be very useful. I'm also a GDScript veteran at this …
-
Hi,
in the Local-LLM notebook we need to import:
`import sys
sys.argv = [sys.argv[0]]
import importlib
import json
import math
import os
import re
import sys
import time
import tracebac…