fused Search Results - Githubissues

1000+ results
for fused

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

LuxDL/Lux.jl #1013

Integration of `oneDNN` for CPU operations

- [x] Add oneDNN binaries to Yggdrasil -- https://github.com/JuliaPackaging/Yggdrasil/pull/9345 - [ ] how expensive is it to construct the internal memory type for the operations? - [ ] create a `on…

avik-pal updated 2 weeks ago
1
NVIDIA/Fuser #2904

testValidate uses half tolerances for fp32 accumulation.

This can lead to false negatives because the threshold is overly relaxed. ```diff diff --git a/tests/cpp/test_gpu_fused_reduction.cpp b/tests/cpp/test_gpu_fused_reduction.cpp index e67875f4..b3923d6…

wujingyue updated 4 weeks ago
1
NVIDIA/TensorRT #3243

[question] Myelin: attention fusion and FlashAttention

Hi! When attention op gets fused in a single op with Myelin, it's not written in trex-tooltip if it's using FlashAttention / proper fusion or not (and if it's using quantization under the hood, especi…

vadimkantorov updated 9 hours ago
15
xrsrke/pipegoose #13

Fused Optimizer

Since our DistributedOptimizer takes another optimizer and turns it into ZeRO-1, can we make it do a fused optimizer like this? It should take an optimizer and turn it into a fused ZeRO-1 in a generic…

xrsrke updated 1 year ago
8
pytorch/pytorch #139002

fatal error C1083: 无法打开包括文件: “cstddef”: No such file or dire…

### 🐛 Describe the bug https://github.com/microsoft/DeepSpeed/issues/6673 try install deepspeed . on torch 2.5.0-cuda then running build_ext ```error D:\my\env\python3.10.10\lib\site-packages…

xiezhipeng-git updated 4 weeks ago
2
villoren/KalmanLocationManager #6

Fused location

Great work you have! Could it have support for fused location ?

whitecloud-sas updated 5 years ago
4
FUSED-Wind/fusedwind #18

FUSED-models

The idea is to create a generic assembly, or a special type of assembly / component, that can do model aggregation. Several models that have similar I/Os for similar purpose would be able to be run in…

rethore updated 7 years ago
1
GTNewHorizons/GT-New-Horizons-Modpack #17831

Increase flying speed of the fused voidwalker boots (NanoSui…

### Your GTNH Discord Username Alchelio ### Your Pack Version 2.6.1 ### Your Proposal QVoiwalkers and NVoidwalkers should increase the flight speed as well, currently you fall like a rock if you …

Nici660 updated 2 weeks ago
4
vllm-project/vllm #10313

[Bug]: FusedMoE kernel performance depends on input prompt l…

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A…

taegeonum updated 15 hours ago
4
ROCm/MIOpen #1213

Dedicated primitive (e.g. `Fused`) for fused convolutions?

https://github.com/ROCmSoftwarePlatform/MIOpen/blob/4e61a3ebdfe2b07b4d331cc46832eb4a6b49941c/src/solver.cpp#L264 Can we introduce some dedicated primitive (e.g. `Fused`) for fused convolutions and …

atamazov updated 6 months ago
2

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for fused

1000+ results
for fused