fp8 Search Results - Githubissues

1000+ results
for fp8

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pinokiofactory/factory #36

TypeError: Trying to convert Float8_e4m3fn to the MPS backen…

Hey i am getting this issue while running flux forge on MAC OS 15.0.1 M3 pro apple silicone

iamumeramin updated 3 weeks ago
1
vllm-project/vllm #4714

[Bug]: export failed when kv cache fp8 quantizing Qwen1.5-72…

### Your current environment pip3 install vllm==0.4.2 nvidia-ammo==0.7.1 Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: …

frankxyy updated 2 weeks ago
2
pytorch/TensorRT #3075

❓ [Question] failed to run the `examples/dynamo/vgg16_fp8_pt…

## ❓ Question I'm trying to run the `examples/dynamo/vgg16_fp8_ptq.y` example but got following error: ``` Traceback (most recent call last): File "/home/wh/generative_action/SynHSI/vgg_quat.p…

broken-dream updated 2 months ago
1
dotnet/TorchSharp #899

`GradScaler` and mixed-precision training

https://pytorch.org/docs/stable/notes/amp_examples.html Currently, `bfloat16` works well without grad scaling. But to use `fp16` and `fp8` (`fp8` - in the future, when the support for Hopper/40XX G…

lostmsu updated 1 year ago
1
comfyanonymous/ComfyUI #5533

Trying to convert Float8_e4m3fn to the MPS backend but it do…

### Expected Behavior To render the image ### Actual Behavior no image it bugs ### Steps to Reproduce Using the ComfyUI workflow present in wiki page ### Debug Logs ```powershell I can not pu…

Creative-comfyUI updated 3 days ago
4
microsoft/DirectML #545

New torch-directml release needed

As a Stable Diffusion user, new torch-directml is needed for issue listed here: bug issue: 1. bad memory allocation: GPU memory leak after every generation; [https://github.com/vladmandic/automati…

ananosleep updated 5 months ago
1
comfyanonymous/ComfyUI #4556

CUDA error: CUBLAS_STATUS_NOT_SUPPORTED

### Expected Behavior I'm having a heck of a time finding a working Torch to just work ... I dunno what happened, but I upraded (all) and it borked my install.. now when I try a comy lora/flux workfl…

bbecausereasonss updated 3 weeks ago
3
Dao-AILab/flash-attention #1160

Question regarding overlapping

First of all, Thank you for sharing your excellent work! I have a question about overlapping (pingpong design). From my understanding: 1) With FP8 precision and a head dimension of 128, the expo…

wookjeHan updated 2 months ago
1
neuralmagic/AutoFP8 #47

[Question] why fp8 kv output is dequantized before attention…

Hi, I noticed that the FP8LinearStatic will dequantize the output(fp8) to input dtype(fp16/bf16) as below L209 shows. 1. Is it due to the attention kernel does not support fp8 at that time? And is it…

danielhua23 updated 22 hours ago
5
pytorch/ao #1066

[ROCm] torchao.float8 should work properly on ROCm

Hi @hongxiayang @hliuca , It seems like float8 training using `torchao.float8` is not support at the moment. Is there a different library or code path I should be using for float8 training or what …

OrenLeung updated 2 days ago
9

上一页 1...35 36 37 38 39 40 41...100 下一页

1000+ results for fp8

1000+ results
for fp8