Closed BBC-Esq closed 2 months ago
https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Width-Base https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Depth-Base https://huggingface.co/nvidia/Mistral-NeMo-Minitron-8B-Base
Not currently viable - tensor shape mismatch. Will re-create when models are corrected.
https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Width-Base https://huggingface.co/nvidia/Llama-3.1-Minitron-4B-Depth-Base https://huggingface.co/nvidia/Mistral-NeMo-Minitron-8B-Base
Model Comparison: Llama-3.1 Minitron and Mistral-NeMo Minitron
Performance Metrics
Common Features
Key Differences