model-deploy Search Results

1000+ results
for model-deploy

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

danny-avila/LibreChat #3956

Enhancement: Token count usage for Gemini models

### What happened? When we call Gemini models from Vertex, no entry of usage is updated in the transaction collection. ### Steps to Reproduce 1. Deploy Gemini pro and/or vision pro in Vertex AI. 2…

ss-gonda updated 1 month ago
1
TBD54566975/ftl #3072

Handle failed deployments

At present if you deploy something that ends up in `CrashLoopBackOff` FTL will wait forever. We need to be able to handle failed deployments without hanging.

stuartwdouglas updated 3 weeks ago
3
second-state/chat-with-chatgpt #384

[Bug]: Device with "gpu" name is not registered in the OpenV…

[Bug]: Device with "gpu" name is not registered in the OpenVINO Runtime Traceback (most recent call last): File "/data/scratch/mkw-anomalib/anomalib-predict.py", line 27, in inferencer = O…

BiankaBeuka updated 4 months ago
9
haotian-liu/LLaVA #600

[Question] How to deploy this model using Amazon SageMaker?

### Question hello. I apologize for asking a really fundamental question. I don't have a way to ask these questions around here. I'm practicing the sagemaker deploy method on huggingface, and it say…

Me1e updated 8 months ago
3
ollama/ollama #5871

finetuning model can't runing in ollama

### What is the issue? I fine-tuned a sqlcoder model and generated a model file. When I deployed it on Ollama, there was a problem. The model could not run and the size of the Ollama file was incorre…

DanielSunHub updated 1 month ago
4
kserve/kserve #3736

add Xinfernece ( an inference platform which integrated tran…

/kind feature **Describe the solution you'd like** Hope add [https://github.com/xorbitsai/inference](https://github.com/xorbitsai/inference) as the kserve huggingface LLMs serving runtime Xor…

jaffe-fly updated 2 weeks ago
6
aws-amplify/docs #7965

Accessing Amazon Bedrock Models Across Regions

### Environment information ```plain text System: OS: macOS 11.7.10 CPU: (8) x64 Intel(R) Core(TM) i7-4980HQ CPU @ 2.80GHz Memory: 1.14 GB / 16.00 GB Shell: /bin/zsh Binaries: Node: 2…

markramrattan updated 1 month ago
5
langgenius/dify #8475

Please support rerank model from infinity

### Self Checks - [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones. - [X] I confirm that I am using English to su…

shizidushu updated 2 weeks ago
1
bubbliiiing/yolact-pytorch #16

How to deploy onnx models in C++?

Given that your preditc module contains convert_to_onnx features and I need to use C++ for inference deployment, how do I do that？

williamForHSP updated 1 year ago
1
NVIDIA/TensorRT-LLM #2088

return-generation-logits bug when fp8 enabled

I am running llama3 model on an rtx4090 with fp8 quantization. In the [result](https://github.com/NVIDIA/TensorRT-LLM/blob/main/cpp/include/tensorrt_llm/executor/executor.h#L323), `outputTokenIds` see…

binhtranmcs updated 1 month ago
2

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for model-deploy

1000+ results
for model-deploy