openai-triton Search Results

1000+ results
for openai-triton

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

intel/intel-xpu-backend-for-triton #170

Make the llvm-target branch use the Triton plugin infrastruc…

The code in the `llvm-target` branch is a fork of the OpenAI Triton code with modifications in several files. The structure of the project mirrors the structure of the AMD port. This work item objecti…

etiotto updated 1 month ago
6
k2-fsa/sherpa #611

Looking for complete conversion from pretrained huggingface …

Hello, I have pretrained a model with huggingface and attempted to deploy it using the TRTLLM-Triton Server method as documented [here](https://github.com/k2-fsa/sherpa/blob/master/triton/whisper/mod…

lionsheep24 updated 1 week ago
7
triton-lang/triton #2781

h100 fp8 gemm with fp16-to-fp8 casting from the load make th…

Hi, When i tried the fp8 gemm code in matmul.py to cast the input "a" to be float16 but casted to fp8 just before the dot product op by setting AB_DTYPE to be tl.float8e4nv (link: https://github.com/…

stephen-youn updated 3 months ago
2
triton-lang/triton #2513

Understanding Triton GEMM FP8 performance

Hello, we have measured the FP8 GEMM performance using Triton on NVIDIA H100 (500 W, 1980 MHz). We would like to request your help in understanding if the performance is expected. Since H100 FP8 o…

sryap updated 2 months ago
14
triton-inference-server/server #6583

Support for vLLM and TRT-LLM running in OpenAI compatible mo…

**Is your feature request related to a problem? Please describe.** I'd like to be able to run vLLM emulating the OpenAI compatible API to use vLLM as a drop-in replacement of ChatGPT. **Describe…

vecorro updated 3 months ago
13
Dao-AILab/flash-attention #887

Hope one day flash-attention can support T4 GPU

hit56 updated 1 month ago
7
OpenMOSS/MOSS #240

macOS安装triton成功

README.md表示“目前triton仅支持Linux及WSL，暂不支持Windows及Mac OS，请等待后续更新。” 但是Windows及Mac OS可以通过[https://github.com/openai/triton](https://github.com/openai/triton)手动安装 ## Install from source ``` git clone http…

PangXitong updated 8 months ago
6
triton-lang/triton #2961

CSE and LICM don't work as expected with exp in the loop

I noticed that > CSE and LICM don't work as expected with `exp` in the loop is mentioned in `/python/triton/ops/flash_attention.py` (credits to Adam P. Goucher @apgoucher ) Can someone expla…

Li-dongyang updated 6 months ago
2
vllm-project/vllm #4950

[Installation]: Reduce Image size when installing wheel with…

### Your current environment Hello, when the Python Wheel is installed according to your documentation: https://docs.vllm.ai/en/latest/getting_started/installation.html#install-with-pip The imag…

ch9hn updated 2 months ago
1
AlexAltea/curator #16

Whisper 20231106 released

New v3 of the language model in 20231106. https://github.com/openai/whisper/blob/main/CHANGELOG.md

alanorth updated 2 months ago
2

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for openai-triton

1000+ results
for openai-triton