aws-samples / foundation-model-benchmarking-tool

Foundation model benchmarking tool. Run any model on any AWS platform and benchmark for performance across instance type and serving stack options.
MIT No Attribution
151 stars 21 forks source link

Bump litellm from 1.34.0 to 1.40.0 in /src/fmbench #112

Open dependabot[bot] opened 4 weeks ago

dependabot[bot] commented 4 weeks ago

Bumps litellm from 1.34.0 to 1.40.0.

Release notes

Sourced from litellm's releases.

v1.40.0

What's Changed

Full Changelog: https://github.com/BerriAI/litellm/compare/v1.39.6...v1.40.0

Docker Run LiteLLM Proxy

docker run \
-e STORE_MODEL_IN_DB=True \
-p 4000:4000 \
ghcr.io/berriai/litellm:main-v1.40.0

Don't want to maintain your internal proxy? get in touch 🎉

Hosted Proxy Alpha: https://calendly.com/d/4mp-gd3-k5k/litellm-1-1-onboarding-chat

Load Test LiteLLM Proxy Results

Name Status Median Response Time (ms) Average Response Time (ms) Requests/s Failures/s Request Count Failure Count Min Response Time (ms) Max Response Time (ms)
/chat/completions Passed ✅ 120.0 133.63252197830545 6.467733658247951 0.0 1936 0 94.77090299998281 801.180971000008
Aggregated Passed ✅ 120.0 133.63252197830545 6.467733658247951 0.0 1936 0 94.77090299998281 801.180971000008

v1.39.6

We're launching team member invites (No SSO Required) on v1.39.6 🔥 Invite team member to view LLM Usage, Spend per service https://docs.litellm.ai/docs/proxy/ui

👍 [Fix] Cache Vertex AI clients - Major Perf improvement for VertexAI models

✨ Feat - Send new users invite emails on creation (using 'send_invite_email' on /user/new)

💻 UI - allow users to sign in with with email/password

🔓 [UI] Admin UI Invite Links for non SSO

✨ PR - [FEAT] Perf improvements - litellm.completion / litellm.acompletion - Cache OpenAI client inviting_members_ui

... (truncated)

Commits
  • 93c9ea1 fix(openai.py): fix client caching logic
  • 63fb3a9 Merge pull request #3961 from BerriAI/litellm_docker_compose_start
  • ce4ba80 build(docker-compose.yml): load local .env in docker compose quick start
  • 2245ee1 test(test_scheduler.py): fix testing
  • 9b4a19b build(docker-compose.yml): startup docker compose with postgres
  • 7715267 fix(router.py): simplify scheduler
  • 27087f6 Merge pull request #3959 from BerriAI/litellm_support_verify_ssl_false
  • d7160eb fix(test_scheduler.py): fix test
  • a16a1c4 fix(http_handler.py): allow setting ca bundle path
  • f75c15d fix(proxy_server.py): security fix - fix sql injection attack on global spend...
  • Additional commits viewable in compare view


Dependabot compatibility score

Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.


Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency - `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself) - `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself) You can disable automated security fix PRs for this repo from the [Security Alerts page](https://github.com/aws-samples/foundation-model-benchmarking-tool/network/alerts).