Granite 3 MoE: The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.
Granite 3 Dense: The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.
Thank you @gabe-l-hart for contributing Granite support to Ollama!
What's Changed
Fix crashes for AMD GPUs with small system memory
Fix error that would occur on macOS 11 Big Sur
Fixed issue creating models from bf16 file types
Improve CPU performance by improving default thread counts
Llama Guard 3: a series of models by Meta, fine-tuned for content safety classification of LLM inputs and responses.
ShieldGemma: ShieldGemma is set of instruction tuned models from Google DeepMind for evaluating the safety of text prompt input and text output responses against a set of defined safety policies.
What's Changed
Default model in quickstart is now llama3.2
Fixed issue where ollama pull would leave connections when encountering an error
ollama rm will now stop a model if it is running prior to deleting it
New Go runner (for contributors):
Ollama is transitioning to a new Go-based subprocess model runner with improved reliability and caching. This new runner found under the llama directory replaces the previous code under llm/ext_server. During this transition period, the new Go runner is opt-in at build time, and requires using a different approach to build. For more information see development.md.
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
Bumps github.com/ollama/ollama from 0.3.12 to 0.3.14.
Release notes
Sourced from github.com/ollama/ollama's releases.
Commits
f2890a4
IBM granite/granitemoe architecture support (#6760)05cd82e
Rename gpu package discover (#7143)7d6eb0d
Move macos v11 support flags to build script (#7203)24636df
Discovery CPU details for default thread selection (#6264)1d7fa3a
Adding 'Ollama App' as community integrations (#6465)09035b7
Add missing BF16 tensor type. (#7193)f3c8b89
Track GPU discovery failure information (#5820)5dd0477
Fix regression on older macos versions (#7192)c3d321d
llm: Remove GGML_CUDA_NO_PEER_COPY for ROCm (#7174)7fe3902
cli: Send all images in conversation historyDependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show