gpu-algorithm Search Results

pytorch/pytorch #132940

Enabling GPU mixed precision algorithms

### 🚀 The feature, motivation and pitch PyTorch does not seem to provide wrappers to mixed precision algorithms in e.g MAGMA (dshpov, shportf) and cuSolver (https://docs.nvidia.com/cuda/cusolver/inde…

jvitordeoliveira96 updated 2 months ago

LMCache/LMCache #139

KV cache loading shouldn't block other requests

We should find scheduling algorithm to reduce GPU idle time.

YaoJiayi updated 2 weeks ago

pmodels/mpich #7024

Allreduce algorithm, performance and codepath issue on ZE gp…

1. We observe a sudden abnormal increase ( 2-3x) in the collective communications with all reduce from 1MB and beyond with GPUs. You can reproduce this issue by measuring the time taken to complete 10…

kaushikvelusamy updated 5 days ago

cvxgrp/scs #77

GPU algorithm stability?

Is there any known solver instability when solving with GPU? E.g. I get the following behavior ``` ---------------------------------------------------------------------------- SCS v1.2.6 - Split…

tachim updated 7 years ago

getindata/kedro-vertexai #178

Support for `Dataproc` serverless components

As I understand, through a tag based system, the plugin can assign different compute targets for each node as follows: ```yaml # excerpt from vertexai.yml # see https://kedro-vertexai.readthedo…

abhi8893 updated 1 week ago

intelligent-machine-learning/dlrover #470

Develop algorithms for auto-tuning both GPU memory usage and…

Making FSDP auto-tune. There are many knobs that users can tune today with FSDP for both scaling and performance.

workingloong updated 9 hours ago

microsoft/Tutel #251

How expert parameters are distributed in the cluster when us…

Sorry, I have some questions to ask： 1、If I set num_local_experts = 2, it means that every gpu has two experts? and the two expert parameters exist on the one gpu? 2、If I set num_local_experts = -2, …

luuck updated 14 hours ago

Tractables/ChowLiuTrees.jl #11

MST algorithm on GPU

MhDang updated 2 years ago

AlgoGenesis/C #1412

Layered Permutation and Mixing Hash (LPMH) Algorithm

# Bucket Hashing: Layered Permutation and Mixing Hash (LPMH) ## Overview **Layered Permutation and Mixing Hash**The Layered Permutation and Mixing Hash (LPMH) algorithm is designed to offer enhanc…

sivanandhinisellamuthu updated 1 day ago

imoneoi/multipack_sampler #4

Algorithm does not work for n=1

Hi author, Thank you for the great work. The algorithm runs very fast! However, I think the current algorithm does not consider the corner case with just single GPU (n=1), and in this case, the a…

LingxiaoShawn updated 1 week ago

1000+ results for gpu-algorithm

1000+ results
for gpu-algorithm