quantizing Search Results

1000+ results
for quantizing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

TexasInstruments/edgeai-torchvision #5

Quantitative training official edgeai-yolov5 code does not c…

I refer to the official quantitative demo training edgeai-yolov5, but the training did not converge. Quantization training examples/quantization_example.py this demo can converge. Refer to quantiz…

Serissa updated 2 years ago
4
instructlab/instructlab #967

Unable to train instructlab/granite-7b-lab-GGUF model

We are able to download the granite model using below command ilab download --repository instructlab/granite-7b-lab-GGUF --release main --filename granite-7b-lab-Q4_K_M.gguf ilab generate is worki…

skairali updated 1 month ago
10
HKUST-Aerial-Robotics/D2SLAM #44

SuperGlue and NetVLAD stuck with tensorRT

Hi, I tried to speed up CNN inference using tensorRT, but the following two problems occurred. Have these issues been addressed? SuperGlue stuck with warning: ``` [W:onnxruntime:SuperGlueOnnx, …

allegorywrite updated 3 months ago
1
huggingface/swift-transformers #30

Memory efficiency

Hey Guys, This is a great library, but I have a question. Is this library is able to use memory as efficiently as the Llama.cpp library? In otherwords, if I'm using a checkpoint that I use with Llama…

hassanzadeh updated 3 months ago
2
ml-explore/mlx-examples #346

Fused & Uploaded Model Losing Coherence

I noticed today that when I use python -m mlx_lm.generate the output doesn't match what I get locally using python lora.py. For example: Local output using lora adapters: ``` (base) Williams-MacBo…

USMCM1A1 updated 9 months ago
6
mit-han-lab/llm-awq #101

Question about Calibration Data

Hi, I have a question about the calibration data: In [calib_data.py](https://github1s.com/mit-han-lab/llm-awq/blob/HEAD/awq/utils/calib_data.py), you re-organize the calib data so that every batch ha…

rainyBJ updated 7 months ago
8
RobinSchmidt/RS-MET #143

stepped portamento with variable slide? how would you do it?

So, I need to implement stepped portamento as another glide mode. The simplest implementation is simply quantizing the pitch always to the nearest note and pitch changes, but I want to do a kind of "v…

elanhickler updated 6 years ago
8
lyogavin/airllm #117

Is it possible to use AirLLM with a quantized input model?

Hi there! Thanks for this amazing library. I was able to run a 70B model on my M2 Macbook Pro! I was able to get about one token every 100 seconds, which is almost good enough for my overnight task…

Verdagon updated 6 months ago
3
microsoft/LQ-Nets #8

Frequent `Segmentation Fault (Core dumped)`

I am trying to run the code in the usage part of the README file. `python imagenet.py --gpu 0,1,2,3 --data /home/bcrc/Datasets/imagenet --mode pre .......` However, I encountered 'core dump' error f…

zhutmost updated 5 years ago
6
bitsandbytes-foundation/bitsandbytes #1320

where are the outliers stored in LLM.int8 quantization for i…

Hi, I'm using `BitsAndBytesConfig` on HF's Transformers library to quantize `facebook/opt-66B` model. But when I print the dtype of weights of varoius layers, all of them turn out to be of `int8`. …

vbayanag updated 1 month ago
2

上一页 1...14 15 16 17 18 19 20...100 下一页

1000+ results for quantizing

1000+ results
for quantizing