unum-cloud / uform

Pocket-Sized Multimodal AI for content understanding and generation across multilingual texts, images, and 🔜 video, up to 5x faster than OpenAI CLIP and LLaVA 🖼️ & 🖋️
https://unum-cloud.github.io/uform/
Apache License 2.0
983 stars 56 forks source link

New Models 🥳 #75

Closed ashvardanian closed 4 months ago

ashvardanian commented 4 months ago

Today we are releasing a new batch of multimodal models trained with Nebius and already available on HuggingFace 🤗

  1. Matryoshka style multimodal embeddings ranging from 64 to 256 and 768 dimensions 🖼️
  2. Improved multimodal chat in 1.2B parameters, tuned with Direct Preference Optimization 💬
  3. ONNX backend, making PyTorch dependency optional for lightning fast deployments ⚡
ashvardanian commented 4 months ago

:tada: This PR is included in version 2.0.0 :tada:

The release is available on GitHub release

Your semantic-release bot :package::rocket: