model-deployment Search Results

hustvl/Senna #6

Model Deployment

I am also interested in your impressive work! As you are from the Horizon Robotics, will you consider deploying the model on Journey 6 platform in the future?

chensiweiTHU updated 1 day ago

big-data-01-org/backend #5

ML Model deployment

v-nemeth updated 3 weeks ago

ethereum-optimism/design-docs #158

Multi-deployment OPCM model

**Title of Meeting** Multi-deployment OPCM model **Date, Time and Duration** Thursday, November 7 · 3:00 – 4:00pm Time zone: America/New_York/EST **Link to design doc** https://githu…

blmalone updated 3 weeks ago

vllm-project/vllm #10664

[Performance]: There is a 10x performance gap between the lo…

### Proposal to improve performance vllm serve /workspace/model/llm/Qwen/Qwen2_5-3B-Instruct\ --host 0.0.0.0 \ --port 2017 \ --tensor-parallel-size 1 \ --gpu-memory-utilization …

LIUKAI0815 updated 3 days ago

cohere-ai/cohere-toolkit #830

Generating Citations with a Custom Model Deployment

### What is the issue? I am unable to generate citations with a custom model deployment by simply yielding a `CITATION_GENERATION` event, nor by including citations in the `STREAM_END` event in the m…

bcicc updated 3 days ago

camunda/camunda-modeler #4607

Desktop Modeler deployment via REST API

### Problem you would like to solve Setting up GRPC/ HTTP2 in company networks through firewalls, load balancers and such sometimes causes problems with customers. ### Proposed solution Use t…

xevien96 updated 1 week ago

getcursor/cursor #1922

Azure OpenAI - Multiple Deployment/Model Support

While initial support was added for Azure OpenAI, there are outstanding issues with the current implementation. **Currently only one deployment/model is supported.** * If I want to use gpt-4o via …

illgitthat updated 1 day ago

microsoft/DeepSpeed-MII #525

multi model deployment

hello, can i once deploy several models for server ?

whcjb updated 2 months ago

NVIDIA/TensorRT-LLM #1600

Deployment of Pruned Models

Hi there, I just want to ask that for the pruned model, how can we deploy it using TensorRT-LLM? Since the qkv dimensions in each layer are different, the model is stored using torch.save rather th…

qianjyM updated 2 weeks ago

afislonge/deepfake-detection-project #49

Model deployment

JuanS286 updated 1 month ago

1000+ results for model-deployment

1000+ results
for model-deployment