ai-inference Search Results

1000+ results
for ai-inference

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

irthomasthomas/undecidability #886

Announcing Together Inference Engine 2.0 with new Turbo and …

- [ ] [Announcing Together Inference Engine 2.0 with new Turbo and Lite endpoints](https://www.together.ai/blog/together-inference-engine-2) # Announcing Together Inference Engine 2.0 with new Turbo …

ShellLM updated 2 months ago
1
kserve/kserve #3736

add Xinfernece ( an inference platform which integrated tran…

/kind feature **Describe the solution you'd like** Hope add [https://github.com/xorbitsai/inference](https://github.com/xorbitsai/inference) as the kserve huggingface LLMs serving runtime Xor…

jaffe-fly updated 1 month ago
5
kubeedge/sedna #430

Sedna joint inference and federated learning controller opti…

**What would you like to be added/modified**: Sedna is an edge-cloud synergy AI project incubated in KubeEdge SIG AI. Benefiting from the edge-cloud synergy capabilities provided by KubeEdge, Sed…

tangming1996 updated 2 months ago
2
NVIDIA/earth2studio #145

🚀[FEA]: Will Earth2Studio support training of AI weather mod…

### Is this a new feature, an improvement, or a change to existing functionality? New Feature ### How would you describe the priority of this feature request Critical (currently preventing usage) …

LevineHuang updated 1 week ago
1
elastic/kibana #192962

[openAI] use usage stats for token counts

OpenAI can now exposes usage stats for the stream completion APIs https://community.openai.com/t/usage-stats-now-available-when-using-streaming-with-the-chat-completions-api-or-completions-api/738156…

pgayvallet updated 1 month ago
2
cpacker/MemGPT #1577

/MemGPT/memgpt/data_types.py:92: UserWarning: Failed to put …

**Describe the bug** After update 0.3.21 Getting --> 2024-07-27 13:34:07,646 - MemGPT.memgpt.server.server - DEBUG - Starting agent step /MemGPT/memgpt/data_types.py:92: UserWarning: Failed to…

quantumalchemy updated 2 months ago
1
mlc-ai/mlc-llm #2927

[Question] If I can run mlc_llm on an arm64 cpu without any …

## ❓ General Questions I have surely installed tvm in my device which has an arm64 on it and I want to run mlc_llm on my device to do model inference. But when I installed mlc_llm on my device li…

AIarong updated 1 week ago
3
Lightning-AI/LitServe #305

Embedding model support with openai spec

Hi, can i do the own custom embedding model deployment with litserve.? any document on this

riyajatar37003 updated 1 week ago
15
NVIDIA/workbench-example-hybrid-rag #18

Resolving Docker Image Pull Error for `huggingface/text-gene…

If you're encountering an error while pulling the `latest` tag of the `huggingface/text-generation-inference` Docker image, follow these steps to resolve it: #### Steps to Fix 1. **Find the Spec…

HydraTechnologies-ops updated 3 weeks ago
2
BerriAI/litellm #2637

[Feature]: Access fine-tuned Gemini via the Google AI Studio…

### The Feature Dnsure that we can access our fine-tuned Gemini via the Google AI Studio adapter. Haven't tested it yet. ### Motivation, pitch You can find-tune Google Gemini Pro 1.0 with yo…

twardoch updated 1 month ago
2

上一页 1...9 10 11 12 13 14 15...100 下一页

1000+ results for ai-inference

1000+ results
for ai-inference