triton Search Results - Githubissues

1000+ results
for triton

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

jax-ml/jax #15900

triton_autotuner: Rounding modifier required for instruction…

### Description Getting the following error when trying to run code on a A100 80GB Google Cloud Debian Deep Learning image ([c0-deeplearning-common-cu113-v20230501-debian-10](https://console.cloud.…

KeremTurgutlu updated 1 year ago
11
triton-lang/triton #1693

@triton.jit cannot be built using pip install -e .

os: Ubuntu 22.04 pytorch: 2.1.0 nightly with cuda 12.1 miniconda-3.10 (latest) When using ```pip install -e . ``` as documented to compile/install triton 2.1.0-dev[head]. @triton.jit does't get b…

Qubitium updated 6 months ago
12
pytorch/pytorch #140080

DISABLED test_comprehensive_nn_functional_batch_norm_cuda_fl…

Platforms: inductor This test was disabled because it is failing in CI. See [recent examples](https://hud.pytorch.org/flakytest?name=test_comprehensive_nn_functional_batch_norm_cuda_float64&suite=Tes…

pytorch-bot[bot] updated 1 hour ago
1
tritonmc/Triton #217

My feature request wishlist for Triton

### Describe the Feature - Being able to rename collections - Making translation things easier by adding a button like "skip to next untranslated item" so that you don't have to navigate in menus li…

PalmTamino updated 1 year ago
6
triton-inference-server/server #7667

Error: ensemble of tensorrt + python_be + tensorrt is suppor…

My setup is: 1. jetson orin 32GB 2. JetPack 6.0 3. Triton 2.40 (NGC Container 23.11) 4. Cuda 12.2, TensorRT 8.6.2 5. Python Backend API 1.16 **`input_0: try to use CUDA copy while GPU is no…

olivetom updated 3 weeks ago
12
alexzhang13/flashattention2-custom-mask #12

Questions about bf16 support and correctness check

Dear Alex. Thanks for this great repo. The flash attention community really needs this feature. I'm trying to integrate this repo in my own project, but encounter two issues: - torch.bfloat16 is n…

xiabingquan updated 1 month ago
1
triton-inference-server/tensorrtllm_backend #356

How to deploy qwen-vl using tensorrtllm_backend?

I would like to deploy qwen-vl using Triton. Do you have any example repositories that are compatible with qwen-vl?

mouweng updated 3 months ago
3
triton-inference-server/fastertransformer_backend #97

triton server crashed after reload the same model

### Description ```shell Host: linux amd64 GPU: RTX 3060 container version:22.12 GPT model converted from megatron (model files and configs are from gpt guide) dockerfile: ---- ARG TRITON_SE…

heiruwu updated 1 year ago
2
FunAudioLLM/CosyVoice #517

Training LLM to the end of an epoch, it seems that the gpu c…

Hello, thank you for your open source. When I train on my own dataset, an error message will be reported at the end of 1 epoch training. The error message is as follows: 2024-10-18 20:47:35,180 D…

CriDora updated 2 weeks ago
3
triton-inference-server/server #7337

Triton server crash when running a large model with an ONNX/…

**Description** I encounter a crash when I am using big model with ONNX backend on CPU. The problem seems to be related to this closed ticket: https://github.com/triton-inference-server/server/issu…

LucasAudebert updated 4 months ago
1

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for triton

1000+ results
for triton