model-server Search Results

1000+ results
for model-server

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

letta-ai/letta #1946

Docker usage, setting OpenAI Key in .env alone is not enough…

**Describe the bug** (Following steps on https://docs.letta.com/server/docker#run-letta-server-with-docker) 1. Clone Repo 2. Create .env with OPENAI_API_KEY="sk-..." 3. docker compose up 4. v…

mcapodici updated 1 week ago
2
mlflow/mlflow #13498

[FR] Add Support for Gemini models in AI Gateway

### Willingness to contribute No. I cannot contribute this feature at this time. ### Proposal Summary I would like MLFlow AI Gateway to add support for Gemini Foundational models ### Motivation >…

ddl-aj-rossman updated 1 week ago
2
lmstudio-ai/lmstudio-bug-tracker #178

Tools calling support

I have tried a variety of models that support function calls, none of them gives the correct answer using LM Studio. Since it uses an 'API compatible with that of OpenAI it should accept the `tools` k…

paoloski97 updated 3 days ago
2
ollama/ollama #6853

Setting temperature on any llava model makes the Ollama serv…

### What is the issue? When calling llava models from a REST client, setting temperature cause the ollama server hangs until process is killed. ### OS Windows ### GPU Nvidia ### CPU AMD ### Ol…

jluisreymejias updated 3 days ago
4
haneefmubarak/Aixen #5

aixen server model

Here's a rough idea of what I have so far: ![aixen server model](https://raw.github.com/haneefmubarak/diagrams/master/aixen/network-server-model.png) Sorry about the graph being messy and all. Anywa…

haneefmubarak updated 10 years ago
13
pytorch/TensorRT #3248

🐛 [Bug] Error when serving Torch-TensorRT JIT model to Nvidi…

## Bug Description I'm trying to serve torch-tensorrt optimized model to Nvidia Triton server based on the provided tutorial https://pytorch.org/TensorRT/tutorials/serving_torch_tensorrt_with_t…

zmy1116 updated 2 weeks ago
3
drudilorenzo/generative_agents #10

Issues with different versions of GPT

In `reverie/backend_server/persona/prompt_template/run_gpt_prompt.py`, multiple requests to OpenAI are made with a hardcoded model `gpt-35-turbo-0125`, which is currently not a valid/supported model o…

martin-krutsky updated 2 days ago
1
intel-analytics/ipex-llm #12363

cant run ollama in docker container with iGPU in linux

here is the the container parameters : export DOCKER_IMAGE=intelanalytics/ipex-llm-inference-cpp-xpu:latest export CONTAINER_NAME=ipex-llm-inference-cpp-xpu-container podman run -itd \ …

user7z updated 16 hours ago
2
containers/podman-desktop-extension-ai-lab #562

Add ability to see the logs of the model server

It would be nice in the model service page, to be able to let the user access to the log of the server - which can be helpful for debugging.

slemeur updated 3 weeks ago
1
envoyproxy/gateway #4431

Add support for with_request_body in SecurityPolicy.spec.ext…

Envoy supports sending the full request body to the external authorization server via the with_request_body filter configuration. Do you think that it is possible to expose such feature on the Securit…

mjf-89 updated 4 days ago
11

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for model-server

1000+ results
for model-server