trained-quantization Search Results

1000+ results
for trained-quantization

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

intel-analytics/ipex-llm #11694

vllm在A770运行过程中，卡住了

A770，Ubuntu系统 ``` for n in $(seq 16 16); do echo "Model= $MODEL RATE= 0.7 N= $n..." python3 benchmark_vllm_throughput.py \ --backend vllm \ --m…

biyuehuang updated 1 month ago
1
BVLC/caffe #4789

Deep compression with Caffe

Hi! I was wondering if there is any possibility in Caffe to compress a neural network. In this paper [Deep compression: Compressing Deep Neural Networks with Pruning, Trained Quantization and Huffman…

naranjuelo updated 7 years ago
1
city96/ComfyUI-GGUF #68

Questions on GGU Q8 Model Performance: T5_FP16 vs T5_Q8 and …

Hello everyone, First off, a big thanks to city96 for the awesome work they've been contributing to the community. It's been incredibly helpful! Here are my system specs: Processor: Intel i5-13…

FerreiraArmando updated 1 week ago
2
tensorflow/tensorflow #62196

Quantization produces large scale coffiecient, which pervent…

### 1. System information Colab , as of 2023-10-23 ### 2. Code Please see the attached colab notebook here https://colab.research.google.com/drive/1yUD0nDu8oeeDtQBa7xCbQWx_w8PxS4UC?usp=sharin…

FabianSchuetze updated 9 months ago
12
kubeedge/ianvs #94

Large Language Model Edge Benchmark Suite: Implementation on…

**What would you like to be added/modified**: A benchmark suite for large language models deployed at the edge using KubeEdge-Ianvs: 1. Interface Design and Usage Guidelines Document; 2. Implem…

nailtu30 updated 3 weeks ago
3
h2oai/haic-doc-issues-requests #5

[HAIC-APP] Lack of documentation

### Documentation issue/request 0 useful information on Quantization. How do i perform it, what settings should i choose for different Quantization types Q8, Q5, (and what would be the difference i…

physicalit updated 8 months ago
1
microsoft/onnxruntime #20630

bf16 kernel (OpSet13) for MatMul in CPU EP

### Describe the issue MatMul in ONNX OpSet 13 started to support bf16 (https://onnx.ai/onnx/operators/onnx__MatMul.html) However, we dont see the implementation for bfloat16 in the CPU EP for …

ZchiPitt updated 3 months ago
1
PKU-YuanGroup/Open-Sora-Plan #138

[🐞Bug] The absence of quantization loss make VQVAE to VAE.

When I try to train VQVAE on my own data, I find the loss for vqvae training is only reconstruction loss https://github.com/PKU-YuanGroup/Open-Sora-Plan/blob/fdc786bc8e52d6386fb32c833eba0b4db286ca7b/o…

cool-xuan updated 5 months ago
1
SuperElastix/SimpleElastix #306

keep data type when transforming label

Hi there, I applied transformix with trained parameters to my labels and got the transformed label with some "quantization noise" which I suspect is from data type conversion to float point. So is…

coldgemini updated 5 years ago
2
analogdevicesinc/ai8x-training #323

QAT

hi there according to the documentation https://github.com/analogdevicesinc/ai8x-training#quantization-aware-training-qat we can use either QAT or post quantization but can I use both of them? if …

fzh-adham updated 4 days ago
5

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for trained-quantization

1000+ results
for trained-quantization