Granite 3 MoE: The IBM Granite 1B and 3B models are the first mixture of experts (MoE) Granite models from IBM designed for low latency usage.
Granite 3 Dense: The IBM Granite 2B and 8B models are designed to support tool-based use cases and support for retrieval augmented generation (RAG), streamlining code generation, translation and bug fixing.
Thank you @gabe-l-hart for contributing Granite support to Ollama!
What's Changed
Fix crashes for AMD GPUs with small system memory
Fix error that would occur on macOS 11 Big Sur
Fixed issue creating models from bf16 file types
Improve CPU performance by improving default thread counts
Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting @dependabot rebase.
Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR:
- `@dependabot rebase` will rebase this PR
- `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it
- `@dependabot merge` will merge this PR after your CI passes on it
- `@dependabot squash and merge` will squash and merge this PR after your CI passes on it
- `@dependabot cancel merge` will cancel a previously requested merge and block automerging
- `@dependabot reopen` will reopen this PR if it is closed
- `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually
- `@dependabot show ignore conditions` will show all of the ignore conditions of the specified dependency
- `@dependabot ignore this major version` will close this PR and stop Dependabot creating any more for this major version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this minor version` will close this PR and stop Dependabot creating any more for this minor version (unless you reopen the PR or upgrade to it yourself)
- `@dependabot ignore this dependency` will close this PR and stop Dependabot creating any more for this dependency (unless you reopen the PR or upgrade to it yourself)
Bumps github.com/ollama/ollama from 0.3.12 to 0.4.0.
Release notes
Sourced from github.com/ollama/ollama's releases.
... (truncated)
Commits
9d71bcc
Update README.md (#7516)a4c70fe
One corrupt manifest should not wedge model operations (#7515)34a7510
prompt: Use a single token when estimating mllama context size4157d1f
readme: add Hexabot to the list of community integrations4ebfa2c
Quiet down debug log of image payload (#7454)046054f
CI: Switch to v13 macos runner (#7498)95483f3
CI: matrix strategy fix (#7496)f247a62
Merge pull request #7456 from ollama/mxyng/llama3.2-vision-mem44bd9e5
Sign windows arm64 official binaries (#7493)18237be
readme: add TextCraft to community integrations (#7377)Dependabot will resolve any conflicts with this PR as long as you don't alter it yourself. You can also trigger a rebase manually by commenting
@dependabot rebase
.Dependabot commands and options
You can trigger Dependabot actions by commenting on this PR: - `@dependabot rebase` will rebase this PR - `@dependabot recreate` will recreate this PR, overwriting any edits that have been made to it - `@dependabot merge` will merge this PR after your CI passes on it - `@dependabot squash and merge` will squash and merge this PR after your CI passes on it - `@dependabot cancel merge` will cancel a previously requested merge and block automerging - `@dependabot reopen` will reopen this PR if it is closed - `@dependabot close` will close this PR and stop Dependabot recreating it. You can achieve the same result by closing it manually - `@dependabot show