lobehub / lobe-chat

🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Azure / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
https://chat-preview.lobehub.com
Other
45.58k stars 10.22k forks source link

[Request] aws bedrock cross-region inference compatibility #4659

Open gemnioo opened 3 weeks ago

gemnioo commented 3 weeks ago

🥰 需求描述

aws bedrock rolling out cross-region inference for Claude 3.5 and some Llama models. it will have compatibility issues with current lobe.env.

🧐 解决方案

designated the region like "us-west-2" Claude 3.5 Sonnet v2 "anthropic.claude-3-5-sonnet-20241022-v2:0"

or with cross-region inference with "us.anthropic.claude-3-5-sonnet-20241022-v2:0"

the difference in Inference profile ID aka 'model identifier' lead to

"Invocation of model ID anthropic.claude-3-5-sonnet-20241022-v2:0 with on-demand throughput isn’t supported" error in lobechat

"The provided model identifier is invalid" error in liteLLM Proxy

📝 补充信息

aws cross-region inference page

https://docs.aws.amazon.com/bedrock/latest/userguide/inference-profiles-support.html

https://us-east-1.console.aws.amazon.com/bedrock/home?region=us-east-1#/inference-profiles

lobehubbot commented 3 weeks ago

👀 @gemnioo

Thank you for raising an issue. We will investigate into the matter and get back to you as soon as possible. Please make sure you have given us as much context as possible.\ 非常感谢您提交 issue。我们会尽快调查此事,并尽快回复您。 请确保您已经提供了尽可能多的背景信息。

gemnioo commented 3 weeks ago

reference:

https://github.com/lobehub/lobe-chat/issues/4458

https://github.com/BerriAI/litellm/issues/6589

gemnioo commented 3 weeks ago

IMO lobechat or others needs find a way to compatible with cross-region inference; rather than add another models.

region us-west-2 with "anthropic.claude-3-5-sonnet-20241022-v2:0" ✅

region us-west-2/us-east-1/us-east-2 with "us.anthropic.claude-3-5-sonnet-20241022-v2:0" ✅

region us-east-1/us-east-2 with "anthropic.claude-3-5-sonnet-20241022-v2:0" ❌

same with Claude last update 3.5 Haiku model.

for the Old Claude 3 Haiku Sonnet Opus models. cross-region inference only in us-east-1 / us-west-2

Screenshot Amazon Bedrock