-
bash scripts/llama_prune.sh
[START] - Start Pruning Model
Traceback (most recent call last):
File "/home/azuryl/anaconda3/envs/llamaprune/lib/python3.10/site-packages/huggingface_hub/utils/_erro…
-
### Checked other resources
- [x] I added a very descriptive title to this issue.
- [x] I searched the LangChain documentation with the integrated search.
- [X] I used the GitHub search to find a…
-
Related
- https://huyenchip.com/2023/04/11/llm-engineering.html
[Tweet thread](https://twitter.com/transitive_bs/status/1646778061160071168?s=46&t=aOEVGBVv9ICQLUYL4fQHlQ) - LLMs in Production host…
-
# ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models
2023 Workshop on Computational Approaches to Subjectivity, Sentiment
“oxymoron” Despite being fun to interact …
-
### Summary
Last year, we released [pytorch-labs/torchao](https://github.com/pytorch-labs/ao) to provide acceleration of Generative AI models using native PyTorch techniques. Torchao added support …
-
- [ ] [RichardAragon/MultiAgentLLM](https://github.com/richardaragon/multiagentllm)
# RichardAragon/MultiAgentLLM
**DESCRIPTION:** "Multi Agent Language Learning Machine (Multi Agent LLM)
(Update)…
-
The idea is to have a system of two small language models: a producer and a feeder. The producer’s task is to generate Python code based on the instructions given by the feeder. The feeder’s task is t…
-
- [ ] [self-speculative-decoding/README.md at main · dilab-zju/self-speculative-decoding](https://github.com/dilab-zju/self-speculative-decoding/blob/main/README.md?plain=1)
# Self-Speculative Decod…
-
启发OpenAI去做GPT的单个[sentiment neuron](https://openai.com/index/unsupervised-sentiment-neuron/),和后面Issue一样,不仅有具体的location还能editing并且操纵模型。[用GPT-4和数据寻找&解释GPT-2的neuron](https://openai.com/index/language-mode…
-
### Feature request
PagedAttention has been a mainstream optimization technology for generation task based on LLMs. It has been supported by a lot of server engines, e.g., [vllm](https://github.co…