openai-triton Search Results

1000+ results
for openai-triton

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

npuichigo/openai_trtllm #47

Feature request - Add all v1/ routes

@npuichigo I am trying to use [Triton Inference Server with TensorRT-LLM backend](https://nvidia.github.io/TensorRT-LLM/quick-start-guide.html#deploy-with-triton-inference-server) with [openweb-ui](ht…

visitsb updated 1 month ago
3
k2-fsa/sherpa #597

How to add hot words in whisper prompt

I completed the concurrency test based on **tensor RT + triton server** deployment, and the concurrency was about doubled compared to faster-whisper. I am testing its accuracy, but the Chinese tr…

taorui-plus updated 1 month ago
1
intel/intel-xpu-backend-for-triton #274

Add Triton backend sendmail feature for Triton community

This is a PR on the upstream `openai/triton`: https://github.com/openai/triton/pull/2629 which use non-anonymous email sending function to avoid being intercepted, like mail connection string: `smtp+…

ESI-SYD updated 1 month ago
3
triton-inference-server/server #6968

vLLM/OpenAI Compatible Endpoint

**Is your feature request related to a problem? Please describe.** vLLM backend works well and is easy to set up, compared to TensorRT which had me pulling my hair. However it lacks the OpenAI co…

Elsayed91 updated 2 months ago
5
Arize-ai/phoenix #3269

Use locally deployed Llm for evaluation.

I need to use locally deployed LLMs for evaluation within my current setup. While setting up LLM monitoring using Phoenix, I require evaluations with the traces, I am only able to find [evaluation llm…

Talhamuhammadali updated 2 months ago
2
npuichigo/openai_trtllm #46

Missing spaces

I have converted Mixtral to TensoRT and I am trying to use your repository to integrate with OpenAI. I'm using the template history_template_llama3.liquid. When I run your example code for interactin…

Mary-Sam updated 1 month ago
2
MIC-DKFZ/nnUNet #2115

torch._dynamo.exc.BackendCompilerFailed: backend='inductor' …

I don't know what's going on, reporting this kind of error. Everything is normal before the training, this problem suddenly occurred, can you help me look at it? 2024-04-20 08:27:16.276530: Epoch 600…

zhaoawen updated 3 weeks ago
14
jeshraghian/snntorch #312

Faster Training Using Triton

# Enhancement Use the [Triton](https://triton-lang.org/main/index.html) compiler from OpenAI to accelerate model training.

korbexmachina updated 3 months ago
2
NVIDIA/TensorRT-LLM #2004

Not found: unable to load shared library: libtensorrt_llm.so…

Hello, I want to deploy llama-3-8b quantized model using tritonserver I followed below steps to do this: 1. create container with nvcr.io/nvidia/tritonserver:24.06-trtllm-python-py3 base image. 3.…

nikhilcms updated 2 days ago
9
lcompilers/lpython #2340

Add Triton to ASR frontend

### The high level motivation Some real world PyTorch benchmarks that we would like to run are at: https://github.com/pytorch/benchmark/tree/64409d5704b6136c6cb28071ff8eba61751b1b02/torchbenchmark/…

certik updated 9 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for openai-triton

1000+ results
for openai-triton