llm-pruning Search Results

191 results
for llm-pruning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

horseee/LLM-Pruner #43

401 Client Error: Unauthorized for url: https://huggingface.…

bash scripts/llama_prune.sh [START] - Start Pruning Model Traceback (most recent call last): File "/home/azuryl/anaconda3/envs/llamaprune/lib/python3.10/site-packages/huggingface_hub/utils/_erro…

azuryl updated 9 months ago
1
langchain-ai/langchain #21610

ConversationSummaryBufferMemory does not work as expected wi…

### Checked other resources - [x] I added a very descriptive title to this issue. - [x] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a…

Sameera2001Perera updated 1 month ago
4
manisnesan/til #33

LLMs in Production - Conference

Related - https://huyenchip.com/2023/04/11/llm-engineering.html [Tweet thread](https://twitter.com/transitive_bs/status/1646778061160071168?s=46&t=aOEVGBVv9ICQLUYL4fQHlQ) - LLMs in Production host…

manisnesan updated 1 year ago
9
zchen0420/nn_papers #6

Humanlike behaviors

# ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models 2023 Workshop on Computational Approaches to Subjectivity, Sentiment “oxymoron” Despite being fun to interact …

zchen0420 updated 3 months ago
8
pytorch/ao #47

[RFC] Plans for torchao

### Summary Last year, we released [pytorch-labs/torchao](https://github.com/pytorch-labs/ao) to provide acceleration of Generative AI models using native PyTorch techniques. Torchao added support …

supriyar updated 5 months ago
21
irthomasthomas/undecidability #681

MultiAgentLLM a faithful recreation of the Small LLMs Are We…

- [ ] [RichardAragon/MultiAgentLLM](https://github.com/richardaragon/multiagentllm) # RichardAragon/MultiAgentLLM **DESCRIPTION:** "Multi Agent Language Learning Machine (Multi Agent LLM) (Update)…

irthomasthomas updated 6 months ago
2
genaforvena/skiffs #4

Experiment with a system of two small language models for ge…

The idea is to have a system of two small language models: a producer and a feeder. The producer’s task is to generate Python code based on the instructions given by the feeder. The feeder’s task is t…

genaforvena updated 8 months ago
11
irthomasthomas/undecidability #680

self-speculative-decoding/README.md at main · dilab-zju/self…

- [ ] [self-speculative-decoding/README.md at main · dilab-zju/self-speculative-decoding](https://github.com/dilab-zju/self-speculative-decoding/blob/main/README.md?plain=1) # Self-Speculative Decod…

irthomasthomas updated 7 months ago
1
zchen0420/nn_papers #11

A Few Neurons: High-level Concentration

启发OpenAI去做GPT的单个[sentiment neuron](https://openai.com/index/unsupervised-sentiment-neuron/)，和后面Issue一样，不仅有具体的location还能editing并且操纵模型。[用GPT-4和数据寻找&解释GPT-2的neuron](https://openai.com/index/language-mode…

zchen0420 updated 3 months ago
5
pytorch/pytorch #121465

[RFC] PagedAttention Support

### Feature request PagedAttention has been a mainstream optimization technology for generation task based on LLMs. It has been supported by a lot of server engines, e.g., [vllm](https://github.co…

liangan1 updated 4 months ago
17

上一页 1...2 3 4 5 6 7 8...20 下一页

191 results for llm-pruning

191 results
for llm-pruning