[Model Request] upstage/SOLAR-10.7B-v1.0

SOLAR-10.7B, is a compact, yet remarkably powerful large language model; it has demonstrated unparalleled state-of-the-art performance in models under 30B parameters - rivaling model's with up to 30B parameters in performance.

SOLAR-10.7B, was developed using Upstage's Depth Up-Scaling. And, it was Built on the Llama2 architecture with integrated Mistral 7B weights integrated into its upscaled layers as part of its pre-training.

Upstage's Depth-Upscaled SOLAR-10.7B has remarkable performance. It outperforms models with up to 30B parameters, even surpassing the recent Mixtral 8X7B model. For detailed information, please refer to the experimental table. Solar 10.7B is an ideal choice for fine-tuning. SOLAR-10.7B offers robustness and adaptability for your fine-tuning needs. Our simple instruction fine-tuning using the SOLAR-10.7B pre-trained model yields significant performance improvements (SOLAR-10.7B-Instruct-v1.0).

https://huggingface.co/upstage/SOLAR-10.7B-v1.0

OpenGVLab / OmniQuant

[Model Request] upstage/SOLAR-10.7B-v1.0 #45

SOLAR-10.7B, is a compact, yet remarkably powerful large language model; it has demonstrated unparalleled state-of-the-art performance in models under 30B parameters - rivaling model's with up to 30B parameters in performance.