kernel-fusion Search Results

1000+ results
for kernel-fusion

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

xrsrke/pipegoose #10

Kernel Fusion using torch.jit

Fuse some popular functions and automatically replace modules in an existing 🤗 transformers model with their corresponding fusion module **APIs** ``` from pipegoose.nn import fusion # and ot…

xrsrke updated 7 months ago
3
pytorch/pytorch #130015

Improve the Inductor generated kernel for the pattern of `ou…

### 🚀 The feature, motivation and pitch **Context**: While using [float8 training](https://fburl.com/4mblb81i), the operators of `fp8 = cast_to_fp8(input_tensor); fp8_t = fp8.t().contiguous().t()`…

y-sq updated 1 day ago
8
openxla/xla #7432

Fusion of Fp8 quantization and amax reduction kernels

For transformer models with small to medium-sized gemms, the advantages of using fp8 cublasLt gemms may be overshadowed by the additional computational overhead introduced by memory loads in the quant…

wenscarl updated 4 months ago
1
Lightning-AI/lightning-thunder #486

FP8 Linear and conv with cudnn

## 🚀 Feature CuDNN provides flexible support for performant gemm/conv with fp8 quantization. Thunder introducing fp8 casts in its traces can benefit from cudnn fusions. ### Motivation Today, thu…

vedaanta updated 1 month ago
1
FluxML/Optimisers.jl #178

GPU kernels for optimizers

### Motivation and description Wondering what kind of speedup can be achieved by writing GPU kernels for optimizers. Take a look at @pxl-th's implementation of Adam below https://github.com/Jul…

vpuri3 updated 23 hours ago
2
CliMA/ClimaAtmos.jl #2950

Improve prognostic implicit edmf performance

This issue is for tracking the performance of the prognostic implicit edmf performance. ```[tasklist] ### Tasks - [ ] Make a reproducer for the kernels discussed here: https://github.com/CliMA/Clim…

charleskawczynski updated 1 month ago
2
AbrarKhan/Linux-4.19.72_CVE-2020-24490 #281

CVE-2020-12652 (Medium) detected in linuxlinux-4.19.30

## CVE-2020-12652 - Medium Severity Vulnerability Vulnerable Library - linuxlinux-4.19.30 Apache Software Foundation (ASF) Library home page: https://mirrors.edge.kernel.org/pub/linux/kernel/v4.x/?…

mend-bolt-for-github[bot] updated 4 days ago
4
RenukaSelvar/kernel_smp #293

CVE-2020-12652 (Medium) detected in linuxlinux-4.19.313

## CVE-2020-12652 - Medium Severity Vulnerability Vulnerable Library - linuxlinux-4.19.313 The Linux Kernel Library home page: https://mirrors.edge.kernel.org/pub/linux/kernel/v4.x/?wsslib=linux F…

mend-bolt-for-github[bot] updated 4 days ago
2
huggingface/transformers #13845

[RFC] Add `modeling_xxx_fusion.py` to support kernel fusion

## Introduction I am an engineer currently working on 3D model parallelism for transformers. When the tensor model parallelism (https://github.com/huggingface/transformers/pull/13726) is done, I am g…

hyunwoongko updated 11 months ago
11
ashishkumar822/PiX #2

FP16 and use_amp support?

FP16 and use_amp support

FlotingDream updated 13 hours ago
2

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for kernel-fusion

1000+ results
for kernel-fusion