auto-quant Search Results

1000+ results
for auto-quant

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/peft #1672

Using PEFT causes model to not predict EOS

### System Info > peft version: 0.9.0 > accelerate version: 0.27.2 > transformers version: 4.37.0 > trl version: 0.7.12.dev0 > base model: openai-community/gpt2 > hardware: 2xA100 I'm doing a…

Km3888 updated 1 month ago
15
Vahe1994/AQLM #109

How to import and use it in my existent code that loads LLMs…

This is the code, I achived 4bits with normal libs import gc import os import re import torch import tensorflow as tf import pandas as pd import matplotlib.pyplot as plt import nltk import …

Kuchiriel updated 1 month ago
3
kadirnar/whisper-plus #101

AttributeError: 'HQQLinear' object has no attribute 'weight'

Thank you for creating this amazing package. It looks very promising. But i am facing some issue on installation on Linux Mint (Ubuntu) desktop. Where i have a NVIDIA RTX 3060 GPU with 12GB VRAM. I …

foduucom updated 3 months ago
7
unslothai/unsloth #348

Loading unsloth/mistral-7b-instruct-v0.2-bnb-4bit error

I am trying to load mistral-7b-instruct-v0.2-bnb-4bit model from unsloth using the following model, tokenizer = FastLanguageModel.from_pretrained( model_name = "models/unsloth/mistral-7b-i…

WillsonAmalrajA updated 4 months ago
1
Daylily-Informatics/bloom #14

Accessioning WF load test findings

## Summary Assess performance of Bloom Db as number of rows scales ## Approach I'm running repeated invocations of `test_create_acceessioning_wf()` and monitoring the size of the `generic_instance`…

adamtracy updated 8 months ago
2
vllm-project/vllm #6576

Fp8 support for mi300x

### 🚀 The feature, motivation and pitch It was not clear for me if the fp8 support is available for rocm. But I got with 5.2 : fp8 quantization is currently not supported in ROCm. There are pla…

ferrybaltimore updated 5 days ago
9
pytorch/ao #208

FP6 dtype!

### 🚀 The feature, motivation and pitch https://arxiv.org/abs/2401.14112 I think you guys are really going to like this. The deepspeed developers introduce FP6 datatype on cards without fp8 suppo…

NicolasMejiaPetit updated 2 months ago
31
vllm-project/vllm #8879

[Usage]: OOM when using Llama-3.2-11B-Vision-Instruct

### Your current environment ```text The output of `python collect_env.py` ``` Collecting environment information... PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorc…

hrson-1203 updated 21 hours ago
20
intel/auto-round #138

question about calib data

Hi, I noticed that the calib data is clipped from the origin input_ids whose length is >= args.seqlen. Can the calib data be generated by packing origin input_ids first and then slicing?

mxjmtxrm updated 2 months ago
15
KevinFire2030/Fire2025 #22

10장 국내 주식 데이터 수집

이번 장에서는 극내 주식 데이터 중 주식티커와 섹터별 구성종목 및 퀀트 투자를 위한 핵심 데이터인 수정주가, 재무제표, 가치지표를 크롤링하는 방법을 알아보겠다.

KevinFire2030 updated 1 year ago
10

上一页 1...77 78 79 80 81 82 83...100 下一页

1000+ results for auto-quant

1000+ results
for auto-quant