cuda-graph Search Results

1000+ results
for cuda-graph

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google-ai-edge/LiteRT #116

null pointer dereference in densify

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version tf 2.14.0 ### Custom code Yes ### OS platform and…

gaikwadrahul8 updated 3 days ago
1
google-ai-edge/LiteRT #115

null pointer dereference in dequantize

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version tf 2.14.0 ### Custom code Yes ### OS platform and…

gaikwadrahul8 updated 3 days ago
1
google-ai-edge/LiteRT #114

null pointer dereference in exp

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version tf 2.14.0 ### Custom code Yes ### OS platform and…

gaikwadrahul8 updated 3 days ago
1
google-ai-edge/LiteRT #112

null pointer dereference in maximum

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version tf 2.14.0 ### Custom code Yes ### OS platform and…

gaikwadrahul8 updated 3 days ago
1
google-ai-edge/LiteRT #111

null pointer dereference in onehot

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version tf 2.14.0 ### Custom code Yes ### OS platform and…

gaikwadrahul8 updated 3 days ago
1
google-ai-edge/LiteRT #107

null pointer dereference in split_v

### Issue type Bug ### Have you reproduced the bug with TensorFlow Nightly? Yes ### Source source ### TensorFlow version tf 2.14.0 ### Custom code Yes ### OS platform and…

gaikwadrahul8 updated 3 days ago
1
microsoft/nnfusion #53

[ENHANCEMENT] CUDA-Graph integration

**🚀 Feature** CUDA-Graph is introduced in CUDA-10.1 to reduce kernel launch overhead. CUDA-Graph matches current NNFusion's design, so it could be easily integrated to cuda_codegen to improve perfo…

xysmlx updated 4 years ago
1
Dao-AILab/flash-attention #312

cuda graph capturing fails

Hi, I found that the unpad_input function makes the cuda graph capture fail if we have key_attention_mask. https://github.com/HazyResearch/flash-attention/blob/72ad03eaa661f6bf3a14c855316c27fbab4f…

stephen-youn updated 1 year ago
1
pytorch/pytorch #109848

[dynamo][stream] Stream runtime operation in FX graph is ign…

### 🐛 Describe the bug Hi, Dynamo can capture the stream and its runtime API with this PR https://github.com/pytorch/pytorch/pull/93808. Stream APIs can be traced into the FX graph and no graph…

zejun-chen updated 4 days ago
7
marcoslucianops/DeepStream-Yolo #585

Fail to get INT8 engine with Deepstream7.1

Hi, I tried to get yolov8 INT8 engine file by following steps from [https://github.com/marcoslucianops/DeepStream-Yolo/blob/master/docs/INT8Calibration.md](url). However, I getting this error "ERROR:…

SiewKee-Lim updated 1 week ago
10

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for cuda-graph

1000+ results
for cuda-graph