-
Source:
We need to update both [model_prices_and_context_window.json](https://github.com/BerriAI/litellm/blob/2a5624af471284f174e084142504d950ede2567d/model_prices_and_context_window.json) and [mo…
-
Just wanted to bring this to your attention as I saw some of the code comments about context limitations:
https://huggingface.co/MaziyarPanahi/Llama-3-8B-Instruct-32k-v0.1-GGUF
-
This issue describes a new high-level entity called "**Projects**".
_Loosely inspired by Claude's_: https://www.anthropic.com/news/projects
- **Projects** could include:
- System prompt
- On…
-
### Code of Conduct
- [X] I have read and agree to the GitHub Docs project's [Code of Conduct](https://github.com/github/docs/blob/main/.github/CODE_OF_CONDUCT.md)
### What article on docs.github.co…
-
Right now, gptel lets you add files and buffers to the context. But if I understand right, from from the code of functions like `gptel-context--insert-buffer-string` and `gptel-context--file-string`, …
-
It seems the Spokestack framework expects an RNN as model and only passes a single frame to the model at a time. In the paper, the WaveNet model expects a time context of 182 frames (1.83s). Will we b…
-
Are Emergent Abilities in Large Language Models just In-Context Learning?
This paper suggests that emergence is the "result from a combination of in-context learning, model memory, and linguistic k…
-
So when mapping a view there is a workaround that can be taken.
* when creating/installing the db dont map the view (to avoid EF creating a table named by the view)
* when using the db map the …
-
I want to quantize the CodeQwen model using a custom dataset, but all sample lengths exceed 512. Why doesn't AWQ support sample with lengths longer than 512? Are there any alternative methods for quan…
-
https://github.com/FlagOpen/FlagEmbedding/tree/master/Long_LLM/activation_beacon
https://huggingface.co/namespace-Pt/activation-beacon-llama2-7b-chat/tree/main
https://arxiv.org/abs/2401.03462
Cu…