fp8 Search Results - Githubissues

1000+ results
for fp8

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

lllyasviel/stable-diffusion-webui-forge #1030

Forge upgrade issues for the latest 10-series and 20-series …

NF4 model 1024 X 1024 resolution 10 Series 20 Series 8G graphics card, running a picture to take four minutes

dddddssa updated 3 months ago
8
vllm-project/vllm #7517

[Bug]: AutoAWQ marlin methods error

### Your current environment vllm 0.5.4 ### 🐛 Describe the bug autoawq marlin must with no zero point， but vllm： ```python def query_marlin_supported_quant_types(has_zp: bool, …

MichoChan updated 2 weeks ago
8
city96/ComfyUI-GGUF #68

Questions on GGU Q8 Model Performance: T5_FP16 vs T5_Q8 and …

Hello everyone, First off, a big thanks to city96 for the awesome work they've been contributing to the community. It's been incredibly helpful! Here are my system specs: Processor: Intel i5-13…

FerreiraArmando updated 2 weeks ago
3
deepseek-ai/DeepSeek-V2 #21

Reproduce inference benchmark mentioned in the paper

I have a few questions about the inference efficiency of deepseek v2 1. > In order to efficiently deploy DeepSeek-V2 for service, we first convert its parameters into the precision of FP8. Ar…

zhouheyun updated 3 months ago
4
NVIDIA/TransformerEngine #962

nan loss when training in fp8 with rotary embedding

Loss in nan in the first batch of training itself when transformer architecture uses [rotary embedding](https://github.com/lucidrains/rotary-embedding-torch)

saurabh-kataria updated 4 months ago
2
ROCm/hipBLASLt #1297

FP64 missing in the table

A | B | C | D | Compute(Scale) -- | -- | -- | -- | -- fp32 | fp32 | fp32 | fp32 | fp32 fp16 | fp16 | fp16 | fp16 | fp32 fp16 | fp16 | fp16 | fp32 | fp32 bf16 | bf16 | bf16 | bf16 | fp32 fp8/bf…

jinz2014 updated 1 hour ago
6
ROCm/AMDMIGraphX #2517

FP8 lossy downcast issue with "ref" implementation

https://github.com/ROCmSoftwarePlatform/AMDMIGraphX/pull/2506/files This PR had to disable FP8 tests for the CPU backend. Ref implementation is doing Float -- > Fp8 -- > Float conversion but C…

umangyadav updated 7 months ago
3
comfyanonymous/ComfyUI #4528

I made an fp8 implementation of flux which gets ~3.5 it/s 10…

### Feature Idea Saw the claim on this reddit thread, hopefully the ideas there can also be brought into comfy for even more speedups. https://www.reddit.com/r/StableDiffusion/comments/1ex64jj/i_m…

Charuru updated 2 months ago
9
camac/Swiper #22

Correct way to initiate swiper after FP8?

I'm wondering if there is a correct and less correct way to do this. Should i add swiper before I add the database to a ondisk project will it filter everything then ? Or do I still need to add filt…

xpagedeveloper updated 6 years ago
1
invoke-ai/InvokeAI #6964

[bug]: many models fail to import when coming from Auto1111

### Is there an existing issue for this problem? - [X] I have searched the existing issues ### Operating system Windows ### GPU vendor Nvidia (CUDA) ### GPU model RTX 3060 ### GPU VRAM 12GB …

Jonseed updated 1 month ago
4

上一页 1...33 34 35 36 37 38 39...100 下一页

1000+ results for fp8

1000+ results
for fp8