inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

triton-inference-server/tensorrtllm_backend #86

Config issue whilst spinning up the server with Falcon

I've followed a mixture of the tutorial for building Falcon [here](https://github.com/NVIDIA/TensorRT-LLM/tree/release/0.5.0/examples/falcon) and for spinning up on the triton inference server [here](…

harryjulian updated 11 months ago
4
triton-inference-server/tensorrtllm_backend #57

Param "stop_words" not respected in v2/models/ensemble/gener…

Hi, it doesn't seem like "stop_words" is respected in the generate endpoint. I'm getting the same output with and without this field ``` curl -X POST localhost:8000/v2/models/ensemble/generate …

yunfeng-scale updated 10 months ago
6
numerique-gouv/meet #62

R&D: Deploy Whisper

# Deploy a Speech-to-Text model These works are focused on Whisper. ## What's whisper? Whisper is a Transformer-based model developed by OpenAI, specializing in Speech-to-Text (STT) tasks, also kno…

lebaudantoine updated 2 weeks ago
3
ChuRuaNh0/FastSam_Awsome_TensorRT #2

Hello, can the trace also give an example of tensorrt deploy…

Hello, can the trace also give an example of tensorrt deployment?

orderer0001 updated 1 year ago
3
RidgeRun/gst-inference #211

Documentation fails to build

``` Making all in plugins DOC Introspecting gobjects (gst-plugin-scanner:9617): GStreamer-WARNING **: 16:27:49.673: Failed to load plugin '../../ext/r2inference/.libs/libgstinference.so': dlo…

michaelgruner updated 5 years ago
1
PaddlePaddle/Paddle.js #513

RuntimeError: (NotFound) Operator (one_hot) is not registere…

# 代码 import gradio as gr from paddlenlp import Taskflow import numpy as np from PIL import Image import uuid # 初始化文档智能任务模型 docprompt = Taskflow("document_intelligence") # 定义模型推理函数 def m…

Idaydayup updated 3 months ago
5
continuedev/continue #1111

Adding Local LLM, LM Studio not working. Doesn't show up in …

### Before submitting your bug report - [X] I believe this is a bug. I'll try to join the [Continue Discord](https://discord.gg/NWtdYexhMs) for questions - [X] I'm not able to find an [open issue]…

NiceShyGuy updated 2 months ago
2
h2oai/h2ogpt #87

NVIDIA Triton inference support

https://github.com/triton-inference-server/ - [x] Build Triton Docker image with support for FasterTransformer backend for Fusion etc. - [x] convert h2oGPT models to format that Triton understands h…

arnocandel updated 1 year ago
1
rescript-lang/rescript-vscode #1034

Stack overflow when showing a type involving recursive polym…

Thanks for all the work on ReScript! I'm running into an issue with the VS Code extension freezing when trying to view the type of `fold` in the following code (simplified from an actual language A…

interphx updated 4 weeks ago
1
HopkinsIDD/flepiMoP #90

Major inference rewrite

Many of the current issues concern inference (#87 #86 #84 #85, ...) At the risk of delaying the solving, wanted to start some discussion about rewriting inference with the current gempyor object st…

jcblemai updated 11 months ago
1

上一页 1...83 84 85 86 87 88 89...100 下一页

1000+ results for inference-server

1000+ results
for inference-server