bfloat16 Search Results

1000+ results
for bfloat16

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

JuliaGPU/Metal.jl #298

Add Support for BFloat16

With Julia 1.11 coming up, we will have native support for BFloat16 https://github.com/JuliaLang/julia/pull/51470. Metal also supports BFloat16 onwards from [Apple6 GPU architecture](https://develo…

tgymnich updated 1 month ago
3
horovod/horovod #4047

support bfloat16

**train with bfloat16** Is there a plan to support bfloat16 training？@maxhgerlach

gl-001 updated 5 months ago
5
intel/torch-xpu-ops #631

[E2E] Some models performance on 1100 lower than 0.5 of A100

### 🐛 Describe the bug Category | Name | Inductor vs. Eager [XPU] | Inductor vs. Eager [CUDA] | XPU vs. CUDA [Eager] | XPU vs. CUDA [Inductor] -- | -- | -- | -- | -- | -- huggingface_amp_fp16_tra…

chuanqi129 updated 15 hours ago
4
intel/torch-xpu-ops #913

[ARC E2E] Timm models accuracy failed

### 🐛 Describe the bug Category | Model | Accuracy -- | -- | -- timm_models_amp_bf16_training | botnet26t_256 | fail_accuracy timm_models_amp_fp16_training | botnet26t_…

mengfei25 updated 1 month ago
2
pytorch/torchtune #1818

qwen2 is not supported by QAT

i try to use QAT to quantize qwen2 1.5B model The error raise from function `training.load_from_full_model_state_dict( model, model_state_dict, self._device, self._is_rank_zero, strict=T…

elfisworking updated 1 month ago
2
Dao-AILab/flash-attention #1338

v2.6.3's flash_attn_varlen_func runs faster than v2.7.0.post…

I found v2.6.3's `flash_attn_varlen_func` runs faster than v2.7.0.post2's `flash_Attn_varlen_func` on H100. code ``` import torch from hopper.flash_attn_interface import flash_attn_func, flash…

complexfilter updated 3 days ago
3
dotnet/runtime #96295

[API Proposal]: BFloat16

### Background and motivation The bfloat16 type provides the same number range as the 32-bit IEEE 754 single-precision floating point type, but with a reduced precision (24 bits -> 8 bits). This is…

iamcarbon updated 3 months ago
28
x448/float16 #22

Support for bfloat16

Thank you for making this very useful and well-tested library! Are you planning to add support for bfloat16 format, which is used in ML field? It has different bit widths for mantissa and exponent, bu…

tisnik updated 3 months ago
7
kijai/ComfyUI-MochiWrapper #43

Error in Mochi VAE Decode Spatial Tiling Node on M1 Max - “I…

I'm encountering an issue with the Mochi VAE Decode Spatial Tiling node when running it on an Apple M1 Max. ![image](https://github.com/user-attachments/assets/7efd22cf-33c0-4bc6-b12f-2de9fb4b8f4b) …

kiminih updated 2 weeks ago
5
pytorch/pytorch #139964

Report issue for torch.nn.Linear when forwarding a 3-dim ten…

### 🐛 Describe the bug Dear all, We seemly found a bug in nn.linear forwarding, here is a minimal example: ```python # import import torch import time # Set input size, output size, an…

shockline updated 4 days ago
5

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for bfloat16

1000+ results
for bfloat16