graph-kernels Search Results

1000+ results
for graph-kernels

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

rapidsai/cuml #6112

[QST] How to Run Multiple Concurrent Tasks on a GPU in Dask…

I want to test the execution of multiple concurrent tasks on the GPU in Dask cuML. I'm using K-means (code below) and I'm changing the chunk size so that I can create multiple tasks for the fit method…

mnlcarv updated 3 days ago
2
pytorch/pytorch #136271

[triton x pt2] Dynamo should trace data_ptr accesses

internal xref: https://fb.workplace.com/groups/1075192433118967/permalink/1505159356788937/ The motivation is that people commonly pack data ptrs into a tensor and then pass the tensor to a triton …

zou3519 updated 3 weeks ago
3
pytorch/pytorch #114048

Undocumented CUDA graphs requirement that kernels must use s…

### 🐛 Describe the bug I have a function that calls some custom cuda kernels interleaved with pytorch operations. When I try to capture the function with a cuda graph, the cuda kernels become no-ops.…

tsengalb99 updated 7 months ago
5
Dao-AILab/flash-attention #1164

Correctness of `flash_attn_varlen_func` kernel with cuda gra…

Thanks for the great repo! We are testing the correctness of `flash_attn_varlen_func` when enabling the cuda graph. This is the test we use. https://github.com/vllm-project/vllm/blob/66e832be41cd3f…

LiuXiaoxuanPKU updated 2 months ago
6
triton-lang/triton #4263

Getting started with Triton for custom hardware backend

I am looking for some pointers to get started with leveraging Triton to generate kernels for a custom hardware backend. I see there have been efforts made that support lowering PyTorch to Triton Ke…

gauravjain14 updated 3 months ago
1
danielegrattarola/spektral #455

RuntimeError: Exception encountered when calling ARMAConv.ca…

Hi, trying to learn spektral here. I can't seem to use ARMAConv: def create_model(n_nodes, n_node_features, model_layers): node_features_input = Input(shape=(n_node_features,), dtype=tf.float3…

donn-liew updated 1 week ago
1
ROCm/omnitrace #335

Enabling Detailed Profiling of Graph Nodes in OmniTrace

Hi, I am currently working on profiling [VLLM](https://github.com/ROCm/vllm/tree/main) and I observed that the tool captures the execution of graph kernels at a high level but does not provide detail…

OmarSayedMostafa updated 1 week ago
2
pytorch/pytorch #120423

Inefficient triton kernels generated by inductor

### 🐛 Describe the bug I gathered all the 10K triton kernels generated by inductor using stack of PR ( https://github.com/pytorch/pytorch/pull/120048 ). After deduping same kernels used by different …

shunting314 updated 1 month ago
7
TheoBourdais/ComputationalHypergraphDiscovery #3

Crashed on quick start example

```python import ComputationalHypergraphDiscovery as CHD import pandas as pd df=pd.read_csv('https://raw.githubusercontent.com/TheoBourdais/ComputationalHypergraphDiscovery/main/examples/SachsData.…

NicolasRouquette updated 7 months ago
2
ty2/ideapad-laptop-tb2024g6plus #5

Problems with newer kernels >= 6.10

Laptop: ThinkBook 16 G6+ IMH (Intel-based) OS: Ubuntu 24.04 LTS/Gnome With some workarounds was able to build this kernel module for 6.8.x kernel(s). And it works pretty good. But with newer ke…

mikamiel updated 1 month ago
4

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for graph-kernels

1000+ results
for graph-kernels