fp8 Search Results - Githubissues

1000+ results
for fp8

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/microxcaling #30

How to set gradient and activation to different formats???

I still can not understand which option( w_elem_format_bp, a_elem_format_bp, a_elem_format_bp_ex, a_elem_format_bp_os ) represents gradient? In fact , in the BP process, I wish to set the gradient as…

rensushan updated 1 month ago
1
NVIDIA/TensorRT-LLM #1638

【Bug Report】llama v3 70B int4 reasoning abnormal

### System Info GPU name (NVIDIA A6000) TensorRT-LLM tage (v0.9.0 main) transformers tage (0.41.0) ### Who can help? @nc ### Information - [X] The official example scripts - [X] My own modified…

vip-china updated 4 months ago
5
Acly/krita-ai-diffusion #1023

flux.1 models

Can you add this model for generative fill? https://github.com/black-forest-labs/flux

redclif43 updated 3 weeks ago
37
smthemex/ComfyUI_StoryDiffusion #94

**Exception Message:** Incorrect format used for image. Shou…

Hi - while I was waiting for you to fix the Pulid FLux - I thought I would try the Maker, dual setting with SDXL base - but I got this error ![image](https://github.com/user-attachments/assets/640552…

adamreading updated 3 days ago
13
comfyanonymous/ComfyUI #4271

Flux.1 Dev, memory issue

### Expected Behavior I expect no issues. I had installed comfyui anew a couple days ago, no issues, 4.6 seconds per iteration~ ### Actual Behavior After updating, I'm now experiencing 20 seconds p…

iKurama updated 2 weeks ago
113
kijai/ComfyUI-CogVideoXWrapper #18

I'm using an RTX 4090 GPU to run a 5B model, but I keep gett…

I'm using an RTX 4090 GPU to run a 5B model, but I keep getting out-of-memory errors. I'm using the cogvideox_5b_example_01 workflow from the example. What could be the reason? ![image](https://githu…

longzy1 updated 1 month ago
11
NVIDIA/TensorRT-LLM #1914

does NVIDIA L20 GPUs support FP8 quantization?

### System Info CPU architecture: x86_64 Host RAM: 1TB GPU: 2xL20 SXM Container: Manually built container with TRT 9.3 Dockerfile.trt_llm_backend TensorRT-LLM version: 0.12.0.dev2024070200" Dr…

jinweida updated 2 months ago
9
comfyanonymous/ComfyUI #4936

Allocation on device | torch.OutOfMemoryError | SamplerCusto…

### Your question it was working fine yesterday but now I am having this error... I don't know why, it's my first time using an image-generation model so I don't know what to do. it is working fine…

Azrox01 updated 2 weeks ago
1
dottxt-ai/outlines #597

Add Outlines model class for outlines-enabled remote API

### What behavior of the library made you think about the improvement? I have just started to use Outlines, and my use case is that I am hosting a local model on a server using [Serve with vLLM](http…

davidsyoung updated 7 months ago
4
NVIDIA/TensorRT-LLM #815

gptSessionBenchmark Failed Because of " Assertion failed: d …

Trying to replicate the benchmark by following [the official guide](https://nvidia.github.io/TensorRT-LLM/performance.html) for Llama2-7b with latest release `0.7.1` and triton server image `23.12-trt…

taozhang9527 updated 7 months ago
11

上一页 1...93 94 95 96 97 98 99...100 下一页

1000+ results for fp8

1000+ results
for fp8