irthomasthomas / undecidability

6 stars 2 forks source link

Mistral NeMo | Mistral AI | Frontier AI in your hands #851

Open ShellLM opened 1 month ago

ShellLM commented 1 month ago

Mistral NeMo | Mistral AI | Frontier AI in your hands

"Today, we are excited to release Mistral NeMo, a 12B model built in collaboration with NVIDIA. Mistral NeMo offers a large context window of up to 128k tokens. Its reasoning, world knowledge, and coding accuracy are state-of-the-art in its size category. As it relies on standard architecture, Mistral NeMo is easy to use and a drop-in replacement in any system using Mistral 7B.

We have released pre-trained base and instruction-tuned checkpoints checkpoints under the Apache 2.0 license to promote adoption for researchers and enterprises. Mistral NeMo was trained with quantisation awareness, enabling FP8 inference without any performance loss.

The following table compares the accuracy of the Mistral NeMo base model with two recent open-source pre-trained models, Gemma 2 9B, and Llama 3 8B."

Suggested labels

{'label-name': 'Large AI Model', 'label-description': 'Refers to state-of-the-art large AI models like Mistral NeMo with up to 128k tokens context window.', 'gh-repo': 'AI-Chatbots', 'confidence': 63.31}

ShellLM commented 1 month ago

Related content

460 similarity score: 0.89

311 similarity score: 0.87

389 similarity score: 0.86

431 similarity score: 0.86

628 similarity score: 0.86

647 similarity score: 0.86