fp8 Search Results - Githubissues

1000+ results
for fp8

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vtuber-plan/olah #14

Getting `500 Internal Server Error` / `Exception in ASGI app…

Here is a small write-up I've build and use it: - https://gist.github.com/andy108369/c487dcd784d93a29e7edca805dd5be57 ``` (.venv) root@node2:~# huggingface-cli download meta-llama/Meta-Llama-3.1-…

andy108369 updated 3 days ago
16
NVIDIA/TensorRT-LLM #961

FP8 run failed on L40s

### System Info CPU x86_64 GPU L40s TensorRT branch: main commid id:b57221b764bc579cbb2490154916a871f620e2c4 CUDA: | NVIDIA-SMI 535.154.05 Driver Version: 535.154.05 CUDA V…

sleepwalker2017 updated 9 months ago
1
kohya-ss/sd-scripts #1570

Possible additon of quantization or Nfloat4 for lower vram (…

I tested nfloat4 quite a bit on onetrainer, and the results are basically the the same as sd scripts, but almost 9gb vram less. I was wondering if its possible for you to implement it on your scrip…

DriveHabits updated 2 months ago
3
NVIDIA/TransformerEngine #556

[PYTORCH::FP8] FP8 significantly slow down when scaling up t…

Small LLMs trained using FP8 with 32 GPUs can achieve 20~30% speed up comparing with bf16. However, scaling up to 1000+ GPUs only achieve less than 5% speed up (TP2 PP4 VP4). Any suggestion to de…

Ageliss updated 10 months ago
4
kijai/ComfyUI-CogVideoXWrapper #34

Sizes of tensors must match except in dimension 2. Expected …

``` !!! Exception during processing !!! Sizes of tensors must match except in dimension 2. Expected size 60 but got size 12 for tensor number 1 in the list. Traceback (most recent call last): Fil…

phr00t updated 1 month ago
8
kohya-ss/sd-scripts #1766

any plans to support training for OpenFlux?

@kohya-ss @lansing @rockerBOO @akx @tsukimiya @wkpark Would you consider supporting training for OpenFlux? The OpenFlux model link is: https://huggingface.co/ostris/OpenFLUX.1. Given that Flux and i…

huxian0402 updated 3 days ago
1
ModelTC/lightllm #479

question about fp8 version of context_flashattention_nopad.p…

[context_flashattention_nopad_fp16_fp8.txt](https://github.com/user-attachments/files/16421521/context_flashattention_nopad_fp16_fp8.txt) we have implemented a f8 version of context_flashattention_…

changyuanzhangchina updated 3 months ago
2
city96/ComfyUI-GGUF #14

Running with 12GB RAM (not VRAM)?

Is there a way to run these models with 12 GB RAM? With fp8 models it is working but with GGUF models it always fail.

GXcells updated 2 months ago
14
leejet/stable-diffusion.cpp #428

Not utilizing gpu during image generation

hi, thank you for providing this code. i am currently running the model schnell q2 in kaggle notebook but when it start generating the image it always shows 'using cpu backend' and it does not utiliz…

Shobhit043 updated 1 month ago
9
city96/ComfyUI-GGUF #85

Are GGUF ControlNets possible ?

Comfy UI is implementing InstantX ControlNets ! "Canny" and "Depth" are working already: https://huggingface.co/InstantX/FLUX.1-dev-Controlnet-Canny https://huggingface.co/Shakker-Labs/FLUX.1-de…

JorgeR81 updated 3 weeks ago
6

上一页 1...26 27 28 29 30 31 32...100 下一页

1000+ results for fp8

1000+ results
for fp8