quant-seq Search Results

1000+ results
for quant-seq

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

InternLM/lmdeploy #979

cannot start triton server by using llama-2-7b-chat

### Checklist - [ ] 1. I have searched related issues but cannot get the expected help. - [ ] 2. The bug has not been fixed in the latest version. ### Describe the bug cannot start triton server by…

zhulinJulia24 updated 9 months ago
1
shirley-wu/text_to_table #7

Could not replicate results as given in paper

Hi! Great work. I followed the instructions as mentioned in the readme on wikitabletext data, and wasn't able to replicate the results. I trained a HAD model, and ran inference using the test_const…

Rellik-7 updated 1 year ago
10
neuralmagic/sparseml #1603

yolov5 sparsifying model error after update sparseml-nightly…

**Describe the bug** Model is not training due to a pytorch problem **Expected behavior** model should be trained normally. **Environment** I have tested the following on two seperate enviroments…

salwaghanim updated 1 year ago
3
oobabooga/text-generation-webui #177

GPTQ quantization(3 or 4 bit quantization) support for LLaMa

[GPTQ](https://arxiv.org/abs/2210.17323) is currently the SOTA one shot quantization method for LLMs. GPTQ supports amazingly low 3-bit and 4-bit weight quantization. And it can be applied to LLaMa. …

qwopqwop200 updated 1 year ago
215
GirinMan/HYU-Graduation-Project-Quantization #4

[Pytorch] BERT 모델 최적화 reference

Pytorch를 사용해 BERT 모델을 최적화하는데 필요한 Reference를 정리했습니다. 졸업프로젝트 주간 모임을 통해 공부할 예정입니다. 1. [이론] 여러 quantization 방법 - https://jin-choi.tistory.com/18 2. [Pytorch] BERT 소스 코드 이해 - https://hyen4110.tist…

plaire48 updated 1 year ago
2
turboderp/exllama #95

3-bit and 2-bit GPTQ support

Hi! While 3-bit and 2-bit quantisations are obviously less popular than 4-bit quantisations, I'm looking into the possibility of loading 13B models with 8 GB of VRAM. So far, loading a 3-bit 13B model…

TechnotechGit updated 1 year ago
23
BrooksLabUCSC/flair #243

Testing Flair: missing file

**Copy and paste the exact command you tried to run** ~/flair/test$ make test **How did you install Flair?** 1. bioconda (e.g. `conda create -n flair -c conda-forge -c bioconda flair`) 5. git …

RDorney updated 1 year ago
2
fly51fly/aicoco #3

爱可可老师24小时热门分享

微博内容精选

fly51fly updated 1 week ago
1906
onnx/onnx #2836

TracerWarning: Converting a tensor to a Python index might c…

I convert a pytorch [model](https://github.com/mit-han-lab/temporal-shift-module) to onnx. ```python example = torch.rand(10, 3, 224, 224) torch.onnx.export(net, # model being run …

Usernamezhx updated 1 year ago
20
COMBINE-lab/salmon #533

Questions about low mapping rates

I know this problem was reported previously. I checked all the answers and I can see there are many reasons for this. In my case, I have a 'high number of mappings discarded because of alignment score…

ShenTTT updated 1 year ago
5

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for quant-seq

1000+ results
for quant-seq