gpt-q Search Results - Githubissues

1000+ results
for gpt-q

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pytorch/pytorch #138340

[Performance] [CuDNN-Attention] CuDNN backend should return …

# Summary This can have large performance impact in real Attention modules. The most common pattern (derived from nano-gpt) ```Python import torch import torch.nn as nn import torch.nn.funct…

drisspg updated 2 weeks ago
21
YuchuanTian/DiJiang #6

Provided code seems to have O(n x n x d) computational compl…

Provided code calculates matrix product of q and k. https://github.com/YuchuanTian/DiJiang/blob/main/modeling/pythia-2.8B-dijiang/modeling_gpt_neox_dijiang.py#L286 That means it has computational …

bilzard updated 3 months ago
6
RVC-Boss/GPT-SoVITS #1268

【求助】超时_训练30min后超时自动退出

"E:\software\Umi\GPT-SoVITS-beta0217fix3\runtime\python.exe" GPT_SoVITS/s2_train.py --config "E:\software\Umi\GPT-SoVITS-beta0217fix3\TEMP/tmp_s2.json" INFO:zhouwensha:{'train': {'log_interval': 100,…

6lingpei6 updated 2 months ago
2
bionic-gpt/bionic-gpt #635

GUI and Quality Issues

## Application - [x] Highlighted menu item icon is wrong color - [x] history search not working - [ ] BlankSlate flickers, image size? - [ ] Documents upload page doesn't update No of Chunks aft…

242816 updated 6 days ago
4
FENRlR/MB-iSTFT-VITS2 #24

frequency noise

[dali.zip](https://github.com/user-attachments/files/17500253/dali.zip) (279000 steps) This is a synthesized 16kHz Chinese audio, where noise is consistently present at specific frequencies. ![i…

Shenkailai updated 2 weeks ago
3
Lightning-AI/lightning-thunder #310

Dtype mismatch with LitGPT and autocast

## 🐛 Bug ### To Reproduce ```python import thunder from thunder.tests.litgpt_model import GPT import torch device = torch.device("cuda") with device: model = GPT.from_name("llama2-li…

carmocca updated 6 months ago
5
lllyasviel/stable-diffusion-webui-forge #1894

Don't know how to solve this :(

Forge worked just fine yesterday. Today I installed requirement.txt for the mixlab node for ComfyUI and ALL my Forge-"installations" gpt messed up 💩 How is that even possible. Both my venv-one and my…

Kallamamran updated 1 month ago
1
RVC-Boss/GPT-SoVITS #744

开启SoVITS训练时报错：RuntimeError: The size of tensor a (350) must …

开启SoVITS训练时报错： ``` D:\python\lib\site-packages\torch\functional.py:650: UserWarning: stft with return_complex=False is deprecated. In a future pytorch release, stft will return complex tensors for…

SRbone updated 2 months ago
2
vllm-project/vllm #3261

8bit quantization

Does vLLM support 8 bit quantization? We need to use vLLM with large context window (>1K tokens). We tried AWQ but the generation quality is not good. Any pointer will be greatly appreciated.

rghosh08 updated 2 weeks ago
5
pytorch-labs/attention-gym #60

How to do KV Cache with FlexAttention and BlockMask by slici…

Is there any example code to do this? Should I generate new BlockMask everytime? Thanks! ------------------------------ Essentially, I have problem of slicing BlockMask. For exmaple, if we have…

Leo-T-Zang updated 1 hour ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for gpt-q

1000+ results
for gpt-q