-
Hi,
I am trying to prune Mistral 7B (https://huggingface.co/mistralai/Mistral-7B-Instruct-v0.2) and while I was able to successfully run the commands for magnitude pruning, I was facing issues with…
-
Mistral AI just dropped Pixtral, their 12b model with vision support.
- https://github.com/mistralai/mistral-common/releases/tag/v1.4.0
- https://www.reddit.com/r/LocalLLaMA/comments/1fe3x1z/mistr…
-
Thank you for developing this!
## Context
Due to lenghty computation time and in order to speed things up, I thought about using the `flash_attention_2` and smaller floating points `torch.float16`…
-
### Jan version
0.5.8
### Describe the Bug
https://discord.com/channels/1107178041848909847/1306758623325851689
> GPU: AMD Radeon RX 6800 XT
> Driver: AMD proprietary driver (version 2.0.31…
-
While trying to use tools with Mistral, here is a sample code for boto3 which works fine:
```python
import json
import boto3
session = boto3.session.Session()
bedrock_client = session.client(…
-
### System Info
请问7B的推理模型结合7B的PRM训练需要多少显存?在测试中发现80G会报显存溢出?是否能在多卡上训练呢?
### Who can help?
@ziyuwan
### Information
- [X] The official example scripts
- [ ] My own modified scripts
### Tasks
- [X…
-
### Self Checks
- [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general).
- [X] I have s…
-
### Check for existing issues
- [X] Completed
### Describe the feature
Support codestral from MistralAI as an equivalent of OpenAI.
Codestral support infill and VsCode plugins are already availa…
-
- [ ] [Mistral NeMo | Mistral AI | Frontier AI in your hands](https://mistral.ai/news/mistral-nemo/)
# Mistral NeMo | Mistral AI | Frontier AI in your hands
## "Today, we are excited to release Mist…
-
They use the same "MistralForCausalLM" structure and seem to share some parameters such as intermediate_size, and I was wondering if it would be possible to merge them together.