graph-kernels Search Results

ysig/GraKeL #111

RandomWalk kernel incompatible with SciPy version >=1.14.0

**Describe the bug** The arguments for the function `scipy.sparse.linalg.cg` changed. `tol` was deprecated and got replaced by `rtol` in version 1.14.0 [[1](https://docs.scipy.org/doc/scipy-1.13.1/re…

klausweinbauer updated 3 days ago

vllm-project/vllm #9417

[Bug]: Regression for AWQ marlin kernels from v0.6.2 to …

### Your current environment First of all: fantastic project :-) Thank you for everything. I would like to fix this bug. But I just do not have the capacity now. So I just thought I would try to m…

joennlae updated 3 days ago

intel/llvm #2053

[SYCL] Unnecessary read_write dependencies when multiple dev…

I have some code that launches multiple kernels and distributes them on multiple queues which are for different CUDA devices. When only 1 gpu is used, we get the following dependency graph: ![dep_gra…

mfbalin updated 3 weeks ago

SNU-ARC/any-precision-llm #7

No real speedup from any-precision-llm kernels

Hello, Similarly to #3, I've tried reproducing the `demo.py` benchmark on an H100 and an A6000 and I'm also seeing no speedup on these platforms at lower precisions. It was mentioned this is du…

pgimenes updated 3 weeks ago

ggerganov/llama.cpp #4085

metal : compile-time kernel args and params

I was just thinking about this idea, so writing it down for future research. We should be able to fairly easy generate model-specific Metal code that has hardcoded kernels for every single node in …

ggerganov updated 3 weeks ago

eloquentarduino/EloquentTinyML #75

Linker Errors compiling for ESP32

Hi! I am getting a bunch of linker errors, appearing to be circular imports. I am using TFLM_ESP32 v2.0 and EloquentTinyML 3.0.1 Thanks in advance. Linking everything together... /home/alvaro…

AlverGant updated 1 month ago

vllm-project/vllm #6378

[RFC]: A Graph Optimization System in vLLM using torch.compi…

### Motivation. At a high level, we at Neural Magic are writing a custom compiler for Torch Dynamo to define a system within vLLM where we can write graph transformations. The main goal is a separa…

bnellnm updated 1 month ago

hikettei/Caten #145

Plans for rewriting Caten/ajit

## Current issue - [ ] Infeasible to merge multiple views - [ ] Cannot support multiple views (e.g. [A, B] -> View -> Shape cannot be inferenced) - [ ] Parallelize Attention QKV Projection …

hikettei updated 4 days ago

ml-explore/mlx #1426

[DEPRECATION] MacOS 15 SDK marks some functions used by MLX …

```bash # env CMAKE_BUILD_PARALLEL_LEVEL="" pip install . -v ``` Includes in the output: ``` /Users/user/Documents/AI/mlx/mlx/mlx/backend/accelerate/matmul.cpp:109:9: warning: 'BNNSLayerParame…

jrp2014 updated 3 weeks ago

NVIDIA/TransformerEngine #1241

How about the torch.compile in TransformerEngine ?

In PyTorch, we know that Torch.Compile will bring us a lot of benefits, and the TransformerEngine also brings performance improvements through strategies such as Transformer fusion optimization, so do…

south-ocean updated 4 days ago

1000+ results for graph-kernels

1000+ results
for graph-kernels