triton Search Results - Githubissues

1000+ results
for triton

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/generative-recommenders #56

Triton is running too slow?

Compared to the same structure(the qkv attention) I implemented with TensorFlow, triton runs 10 to 20 times slower. With the help of nsight system, I found that cudaMemcpySync takes off much time whil…

bzxc updated 1 month ago
1
linkedin/Liger-Kernel #179

Improve the efficiency of the RMSNorm aggregation

### 🚀 The feature, motivation and pitch Modify this line https://github.com/linkedin/Liger-Kernel/blob/main/src/liger_kernel/ops/rms_norm.py#L306, the sum in pytorch to partial aggregation in trito…

lancerts updated 6 days ago
5
JonathanSalwan/Triton #1353

MEMORY_ARRAY and sub-word symbolic reasoning

```py def test_symbolic_rw_in_array_mode(): code = { 0x1000: bytes.fromhex("FD030091"), # mov x29, sp 0x1004: bytes.fromhex("FF4300D1"), # sub sp, sp, #16 0x1008: by…

0x9047 updated 2 weeks ago
1
JonathanSalwan/Triton #1347

make -j3 capstone

Running into build error, anyone else getting this? ``` [ 2%] Building CXX object src/libtriton/CMakeFiles/triton.dir/arch/arm/aarch64/aarch64Cpu.cpp.o [ 2%] Building CXX object src/libtriton/C…

yorkyman updated 1 month ago
6
BBuf/flash-rwkv #1

Triton support

If you're planning to make this API somehow standardized it would be great to integrate Songlin Yang's excellent new Triton RWKV-6 implementation from FLA https://github.com/sustcsonglin/flash-linear…

SmerkyG updated 4 months ago
1
triton-inference-server/server #7472

Triton crashes with SIGSEGV (signal 11)

**Description** Triton receives SIGSEGV during handling the traffic. Last thing that it wrote out was `E0723 11:57:36.328641 1 infer_handler.h:187] ""[INTERNAL] Attempting to access current response …

JindrichD updated 1 week ago
4
state-spaces/mamba #386

Triton Error [CUDA]: device kernel image is invalid

Thanks for the wonderful work. When running Mamba2, I encountered the error "Triton Error [CUDA]: device kernel image is invalid". Should you be so kind as to provide some advice? My enviro…

rationalspark updated 1 day ago
8
triton-inference-server/model_analyzer #908

wrong with --triton-launch-mode=remote

### **Problem:** When using model-analyzer with --triton-launch-mode=remoted, I encounter connectivity issues. ### **Context:** I have successfully started Triton Inference Server on the same ser…

RheaRia updated 3 weeks ago
1
state-spaces/mamba #369

triton error while running Mamba2 with slow path

as #355 , I added "@torch.compile(options={"triton.cudagraphs": True}, fullgraph=True)" to "mamba_chunk_scan_combined" function in file "ssd_combined.py", and running failed with error: ``` Unsup…

Seeker98 updated 1 day ago
10
triton-lang/triton #4319

error: fp8e4nv data type is not supported on CUDA arch < 89

https://github.com/triton-lang/triton/blob/95623038c75463286aa5d4a44782ba7492cc1afa/python/triton/language/semantic.py#L761C1-L763C1 how to resolve this

yiyepiaoling0715 updated 1 week ago
2

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for triton

1000+ results
for triton