inference-server Search Results

1000+ results
for inference-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

h2oai/h2ogpt #87

NVIDIA Triton inference support

https://github.com/triton-inference-server/ - [x] Build Triton Docker image with support for FasterTransformer backend for Fusion etc. - [x] convert h2oGPT models to format that Triton understands h…

arnocandel updated 1 year ago
1
rescript-lang/rescript-vscode #1034

Stack overflow when showing a type involving recursive polym…

Thanks for all the work on ReScript! I'm running into an issue with the VS Code extension freezing when trying to view the type of `fold` in the following code (simplified from an actual language A…

interphx updated 4 weeks ago
1
HopkinsIDD/flepiMoP #90

Major inference rewrite

Many of the current issues concern inference (#87 #86 #84 #85, ...) At the risk of delaying the solving, wanted to start some discussion about rewriting inference with the current gempyor object st…

jcblemai updated 11 months ago
1
microsoft/JARVIS #134

Is anyone build this demo successful on the MacOS?

I try to build on the Macbook Pro with M1 Pro full version, and system version is Macos Ventura 13.1 I run command by >> **python models_server.py --config config.gradio.yaml** I have encountered …

cclemon2143 updated 1 year ago
1
csukuangfj/kaldifeat #26

potential memory leak

Hi~ I use valgrind to test the below line: ``` import kaldifeat ``` The log is here: https://1drv.ms/t/s!AhtoTbISXXlSgSLtqDWjo8RP_ouA?e=Sn1cd0 Log summary: ``` ==20510== LEAK SUMMARY: =…

Slyne updated 1 year ago
4
huggingface/text-generation-inference #2388

[BUG] Running FP8 quantized model fails on NVIDIA L4 (repack…

### System Info - **Hardware**: AWS g6.12xlarge (us-east-2) / 4x NVIDIA L4 GPU - **OS**: Ubuntu 24.04 LTS (Noble Numbat) - **NVIDIA Driver**: nvidia-open 560.28.03 - **CUDA**: 12.6 - **Docker**: …

DrNochi updated 1 week ago
5
toolboc/IntelligentEdgeHOL #11

Error running YOLO "NvRmPrivGetChipPlatform: Could not read …

Hi, Firstly thanks for the awesome example - excited to get it working. I am having trouble running the YOLO on the NVidia Jetson. Below is the log output. I feel the error is this: "NvRmPrivGet…

tank104 updated 1 year ago
1
triton-inference-server/tensorrtllm_backend #367

sreaming mode doesn't work

### System Info V100*2 nvcr.io/nvidia/tritonserver:24.01-trtllm-python-py3 tensorrt-llm 0.7.0 ### Who can help? _No response_ ### Information - [X] The official example scripts - [ ] My own mo…

dongteng updated 4 months ago
2
triton-inference-server/tensorrtllm_backend #337

modelInstanceState: [json.exception.out_of_range.403] key 'b…

**Description** Trying to deploy Mistral-7B with Triton+TensorRT-LLM and running into this issue **Triton Information** Are you using the Triton container or did you build it yourself? nvcr.i…

shamikatamazon updated 5 months ago
12
TabbyML/tabby #624

Tabby VSCode Extension: Autostart Tabby Server

**Context** I use Tabby VSCode extension with a local Tabby server. Currently, when I start VSCode and the Tabby server is not running, it reminds me of that through the yellow indicated extension i…

matthiasgeihs updated 2 months ago
14

上一页 1...84 85 86 87 88 89 90...100 下一页

1000+ results for inference-server

1000+ results
for inference-server