-
I see issue #134 mentions:
> Consider https://github.com/weaviate/weaviate if want to store both vectors and objects in db -- not necessarily wanted in general, but makes db stable against needing …
-
OS: Fedora 38
GPU: RX 6700 XT (rocm)
When trying to run the webui, the compiling of exllama_ext fails with multiple files not found. The second error is repeated 10 times, which I left out of the…
-
Hi,
I get a segmentation fault when using act-order.
Reproduction:
```python
from model import Ex4bitLinear, ExLlamaConfig
import torch
import torch.nn as nn
config = ExLlamaConfig("../…
-
I've used lite.koboldai.net for the past 1~2 weeks, as well as running a worker off-and-on via KoboldAI (version: 0cc4m/koboldai : latestgptq)
(Hardware: GTX 1070 (8GB))
And have noticed some stra…
-
**Is your feature request related to a problem? Please describe.**
Despite building with cuBLAS, `LocalAI` still uses only my CPU by the looks of it.
**Describe the solution you'd like**
Usag…
-
### System Info
MAC OS 13.1 13.1 22C65
Python3.11
### Information
- [X] The official example notebooks/scripts
- [ ] My own modified scripts
### Related Components
- [X] backend
- [ ] bindings
-…
-
Can the non-UI logic in llm-chat.js, such as Class LLMChatInstance, Class LLMChatPipeline, and related dependencies like tvmjs.bundle.js and tvmjs_runtime.wasi.js, be separated and packaged into an np…
-
Sometimes koboldcpp crashes when using `--useclblast`
Not using BLAS or only using OpenBLAS works fine. It only crashes when i add `--useclblast 0 0` to the command line.
I'm not sure if this has …
-
Hello,
I am trying to get llama2 installed on my laptop. I am using MacBook Pro, Apple M2 Max, MacOS Ventura 13.0 (22A8380). I have 32 GB unified memory. "32GB of unified memory makes everything yo…
-
**Description**
Integrate the UI with gpt-index (llama-index) or langchain to greatly extend features
**Additional Context**
https://github.com/jerryjliu/llama_index
https://github.com/hwchase…