trained-quantization Search Results

1000+ results
for trained-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

4AI/LS-LLaMA #19

Bitsandbytes quantization extension

Hi, thanks for sharing the code. I have tryed to use your repo using `bitsandbytes` for model quantization. Unfortunately, the training process does not work: the layers defined in `modelling_llama.p…

ferrazzipietro updated 2 months ago
3
microsoft/unilm #1502

1.58bitnet - is it {-1,0,1}?

Hi, thank you for providing 1.58bit implementation. Nice work! I looked through many bitnet1.58 implementations and noticed that they all use the method suggested in "The Era from 1-bit LLMs: Training…

rkinas updated 2 weeks ago
2
mlc-ai/mlc-llm #2517

[Bug] chatglm4 mlc_llm shows error "TVMError: Check failed: …

mlc-ai-nightly-cu122 0.15.dev404 mlc-llm-nightly-cu122 0.1.dev1355 transformers 4.41.2 git clone https://huggingface.co/THUDM/glm-4-9b-chat mlc_llm convert_we…

lihaofd updated 1 month ago
9
opensearch-project/k-NN #1779

[RFC] Optimized Disk-Based Vector Search

## Introduction This document outlines a high level proposal for providing efficient, yet easy to use k-NN in OpenSearch in low-memory environments. Many more details to come in individual compone…

jmazanec15 updated 4 days ago
4
tensorflow/tensorflow #27381

Generating fully-quantized models in Pre-trained checkpoints…

Please make sure that this is a feature request. As per our [GitHub Policy](https://github.com/tensorflow/tensorflow/blob/master/ISSUES.md), we only address code/doc bugs, performance issues, feature …

nullbyte91 updated 4 days ago
11
microsoft/VQ-Diffusion #29

device-side assert triggered

I trained taming-transformers on my own data set and got the ckpt file and the corresponding yaml file. When I apply it to vq-diffusion, an error will be reported. I followed `configs/imagenet.yaml`. …

zideliu updated 2 weeks ago
3
pytorch/executorch #290

Upcoming changes to export API in ExecuTorch (published on 9…

## Where are we? Exporting pytorch model for ExecuTorch runtime goes through multiple AoT (Ahead of Time) stages. At high level there are 3 stages. 1. `exir.capture`: This captures model’s graph …

kimishpatel updated 1 day ago
27
vllm-project/vllm #5793

[Bug]: Different quality responses using GPTQ / marlin kerne…

### 🐛 Describe the bug Hello, I am running llama3-70b and mixtral with VLLM on a bunch of different kinds of machines. I encountered wildly different quality performance on A10 GPUs vs A100/H…

joe-schwartz-certara updated 2 weeks ago
4
hustzxd/LSQuantization #7

How to deploy the quantized model?

When we have trained the quantization model, how to deploy it?

jiinhui updated 10 months ago
5
fredliu168/GLM4_openai_api #1

transformers 报错

(chatglm) n:\github\GLM-4>python openai_api_lby.py 2024-06-12 15:24:16,061 - Start initialize model... Special tokens have been added in the vocabulary, make sure the associated word embeddings are …

maxadc updated 1 month ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for trained-quantization

1000+ results
for trained-quantization