llm-compression Search Results

508 results
for llm-compression

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/LLMLingua #78

Getting errors when running phi2

I'm getting the following error: ``` Special tokens have been added in the vocabulary, make sure the associated word embeddings are fine-tuned or trained. Loading checkpoint shards: 100%|████████…

TempusFugit05 updated 6 months ago
1
jina-ai/GSoC #18

Research about deploying LLM with Jina

### *Project idea 3: Research about deploying LLM with Jina* | info | details | | ---------------- | ------------------------------…

Nick17t updated 6 months ago
14
NVIDIA/TensorRT-Model-Optimizer #14

Tried to apply PTQ to a basic CV CNN network and got slower …

I used mtq.INT8_default_CFG as recommended for CNN networks (mtq.quantize(model, config, forward_loop). My initial model ran at 80FPS after quantization it dropped to 40FPS? I checked the model struct…

tmagcaya updated 5 days ago
8
microsoft/LLMLingua #74

llama instead of gpt

Just a few questions about using LLMLingua. 1. How do I adjust the code so that I am using Llama instead of GPT? 2. The reason I am using Llama instead of GPT is because I don't want my data to be s…

jwahnn updated 6 months ago
3
microsoft/LLMLingua #48

PromptCompressor error - OpenAIGPTLMHeadModel.forward() got …

I am trying to use OpenAI GPT-2 model for prompt compression. However, getting error "OpenAIGPTLMHeadModel.forward() got an unexpected keyword argument 'past_key_values'". Has anyone faced/facing simi…

manojsharmadcx updated 6 months ago
3
waifuoid/llmlingua-api #1

Need client demo

The code is concise and very helpful. Please provide a demo for the client.

huyinguo updated 4 months ago
1
kjappelbaum/gptchem #25

Inaccessible examples

Hi, I wanted to run some of the examples, but it seems like the dropbox links inside [gptchem/data.py](https://github.com/kjappelbaum/gptchem/blob/main/src/gptchem/data.py) only lead to dropb…

nicodomschke updated 2 months ago
4
oneapi-src/oneDNN #1971

New/other Matrix multiplication algorithm implementation

Hello, 1. The current implementation for matrix multiplication uses BRGEMM algorithm. Is there any implementation of "Low Rank Approximation approach" for matrix multiplication in oneDNN? Is there a…

vineel96 updated 1 week ago
9
konabuta/my-scratch-book #14

Blog: Techniques for training large neural networks

## Blog: Techniques for training large neural networks Link: https://openai.com/research/techniques-for-training-large-neural-networks This blog explains the parallism technologies for training la…

konabuta updated 4 months ago
6
OpenDevin/OpenDevin #3041

[Bug]: Unable to SSH into session when using "evaluation" sc…

### Is there an existing issue for the same bug? - [X] I have checked the troubleshooting document at https://docs.all-hands.dev/modules/usage/troubleshooting - [X] I have checked the existing iss…

priyanshu-kumar-256 updated 5 days ago
19

上一页 1...2 3 4 5 6 7 8...51 下一页

508 results for llm-compression

508 results
for llm-compression