triton Search Results - Githubissues

1000+ results
for triton

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-lang/triton #1191

How to install?

# pip3 install triton Defaulting to user installation because normal site-packages is not writeable ERROR: Could not find a version that satisfies the requirement triton (from versions: none) ERROR…

Stefar77 updated 1 year ago
4
triton-lang/triton #2099

AttributeError: module 'triton' has no attribute 'jit'

Hi friends: I installed triton from the source. But the triton compiler generates an error which is "AttributeError: module 'triton' has no attribute 'jit'". Someone knows how to solve this? Than…

tiandiao123 updated 11 months ago
2
microsoft/BitBLAS #90

Speedup problem with GPTQModel

Hi I test bitblas models with the [https://github.com/ModelCloud/GPTQModel](https://github.com/ModelCloud/GPTQModel) repo. I found that the output is correct. However, BitBLAS obtains similar to…

ChenMnZ updated 4 days ago
10
triton-lang/triton #2218

Segfault in TTIR when doing convert s8->f32 + dot

We created the following TTIR, which was previously working for us. However, it is now segfaulting at HEAD on main. The TTIR is essentially loading 2 parameters (one s8, one f32), convert the s8 to…

karupayun updated 1 year ago
1
triton-lang/triton #2898

[Feature Proposal] Adding triton-lsp-server for viewing

[MLIR LSP server](https://mlir.llvm.org/docs/Tools/MLIRLSP/) is a tool for IDE to understand `.mlir` files of various dialects. By integrating with `mlir-lsp` related tools, we can make IDE aware of t…

youkaichao updated 8 months ago
16
triton-inference-server/server #4772

Python Backend to support GPU instance

**Is your feature request related to a problem? Please describe.** A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] I am using TRTIS on jetson Orin versio…

MhdKAT updated 13 hours ago
9
cloneofsimo/minRF #1

Support better kernel fusion for MMDiT architecture

either torch.compile / triton, forward / backward operations got too much activations that are probably bottlenecking training. For some reason, i got about 30% speedup at 1B scale but does not seem …

cloneofsimo updated 4 months ago
2
NVIDIA/TensorRT-LLM #1884

enc_dec: prompt_embedding_table not passed to encoder model

### System Info Tensorrt-LLM commit: 2a115dae84f13daaa54727534daa837c534eceb4 TensorRT-LLM version: 0.11.0.dev2024061800 ### Who can help? _No response_ ### Information - [X] The official exam…

thefacetakt updated 1 month ago
3
torvalds-dev/llama_index #33

[Feature Request]: Nvidia Triton Tensor RT LLM Integrations

### Feature Description I would like to add support for Nvidia Triton TensorRT LLMs in llama index. There is currently support for several other LLM endpoints and Nvidia has several interesting offer…

torvalds-dev updated 9 months ago
3
pytorch/pytorch #132865

aotautograd CSE interferes with Inductor reinplacing

~~Here's a theoretical example, but note that this doesn't actually repro (fx CSE doesn't seem to work in this case?). I'm seeing this with a much larger model and will try to get a real repro soon.~~…

zou3519 updated 2 weeks ago
3

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for triton

1000+ results
for triton