-
Now that I have my GPU used by localai I wanted to try whisper locally via `:GpWhisper` after installing sox and I got a not very helpful:
```
Gp: Whisper query exited: 2, 0
```
I had installed …
teto updated
2 months ago
-
### Describe the issue
Thank you for the amazing work!
1. Does the model store the whole kv-cache of prefilling and generation on device? If so, how can the device hold the memory of 1M kv value…
-
Hi @philschmid,
When I try to increase the chunk length to be greater than 2048, the training fails and runs into an OOM error on g5.4xlarge.
Totally makes sense why it's happening, my question i…
-
Designing a Multi-Layered Hierarchy of Control
You
I'm working on a idea for a multi-layered hierarchy of control
Copilot
That sounds like an interesting project! A multi-layered hierarchy of co…
-
Possibly related to #4177 but it also seems sufficiently different…
## Expected Behaviour
When I enter attachment view of a message that's forwarding another message, I can hit `` on the `…
-
### Describe the bug
When using Langchain ContextualCompressionRetriever, "run not found" was raised.
```
Traceback (most recent call last):
File "/lib/python3.11/site-packages/langfuse/cal…
-
### Checked other resources
- [X] I added a very descriptive title to this issue.
- [X] I searched the LangChain.js documentation with the integrated search.
- [X] I used the GitHub search to find a …
-
**Describe the bug**
A clear and concise description of what the bug is.
When trying to quantize the StarCoder2 models, I run into a index error due to estimates of the quantization. Specifically,…
-
javascript weekly news
-
Great thanks to the authors of this project!
Bytedance's [TiTok](https://arxiv.org/pdf/2406.07550) use 1d codebook achieves impressive 256x256 to 32 token super high compression ratio, this is ver…