bfloat16 Search Results

1000+ results
for bfloat16

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

keras-team/keras #18448

`mixed_bfloat16` in TPU is slower than `float32`

In short, we observed `mixed_bfloat16` in TPU is slower than `float32` in our model benchmarks. Please refer to this [sheet](https://docs.google.com/spreadsheets/d/1TPwbe8p6eD61arkoIXQnPHf3rgFIDFUZCot…

chenmoneygithub updated 3 days ago
3
IDEA-Research/Grounded-SAM-2 #38

ms_deform_attn_forward_cuda" not implemented for 'BFloat16

Hello! This is the problem when I use `grounded_sam2_local_demo.py` for image inference

ChinChyi updated 3 weeks ago
7
KohakuBlueleaf/LyCORIS #215

GLoRA inference fails with Flux due to weights being in bflo…

My simple inference script is failing when calling wrapper.merge_to() with Flux Dev as the base model. ``` 2024-09-21 19:27:53|[LyCORIS]-INFO: Loading Modules from state dict... 2024-09-21 19:27:…

mhirki updated 1 week ago
1
NVIDIA/cutlass #1757

`02_pytorch_extension_grouped_gemm.ipynb` No kernel configu…

**Describe the bug** I followed [02_pytorch_extension_grouped_gemm.ipynb](https://github.com/NVIDIA/cutlass/blob/main/examples/python/02_pytorch_extension_grouped_gemm.ipynb). And I change dtype from…

hxdtest updated 4 days ago
3
AcademySoftwareFoundation/openvdb #1892

[BUG] failed to build fVDB with cuda-12.2

### Environment **Operating System:** Linux NixOS **Version / Commit SHA:** fVDB **Other:** gcc 10.5.0 ### Describe the bug I'm trying to build fVDB with CUDA 12.2, but the build fails with t…

yzx9 updated 5 days ago
2
balazik/ComfyUI-PuLID-Flux #11

CUDA ERROR

got prompt !!! Exception during processing !!! No operator found for `memory_efficient_attention_forward` with inputs: query : shape=(1, 577, 16, 64) (torch.bfloat16) key : …

abozahran updated 3 hours ago
1
ROCm/ROCm #2534

Difference between hip_bfloat16 and __hip_bfloat16?

I'm using ROCm 5.7. Currently there are two datatypes for `bfloat16` -- `hip_bfloat16` and `__hip_bfloat16`. They seem to be defined respectively as ``` struct __hip_bfloat16 { unsigned short d…

pcmoritz updated 9 months ago
10
NVIDIA/TensorRT-LLM #1957

Model Performance Degraded when using BFLOAT16 LoRa Adapters

### System Info 2X L4 GPUs Docker Image: nvcr.io/nvidia/tritonserver:24.06-trtllm-python-py3 ### Who can help? @juney-nvidia @kaiyux ### Information - [ ] The official example sc…

TheCodeWrangler updated 2 weeks ago
8
NVIDIA/TensorRT-LLM #2247

Invalid MIT-MAGIC-COOKIE-1 key

### System Info - OS: Ubuntu 20.04 - GPU: RTX 2080TI ### Who can help? @byshiue @ncomly-nvidia ### Information - [x] The official example scripts - [ ] My own modified scripts ### Tasks - [x] …

sherlcok314159 updated 5 days ago
4
Fanghua-Yu/SUPIR #33

bfloat16 error

Hi I'm testing the local install & interface Dr. Furkan Gözükara made for Supir and its its working really well on a 4090 but i get the following error when i try to use it on an RTX8000. RuntimeE…

FabricatedGirls updated 7 months ago
7

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for bfloat16

1000+ results
for bfloat16