mistralai Search Results

1000+ results
for mistralai

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

predibase/lorax #595

Flash Attention is not installed?

``` raise NotImplementedError("flash attention is not installed") NotImplementedError: flash attention is not installed 2024-09-06T05:28:11.268655Z ERROR shard-manager: lorax_launch…

ObliviousDonkey updated 1 week ago
8
InternLM/xtuner #533

Mixtral 8x7B SFT 问题

您好，正在尝试微调mixtral 8x7b，但是训练一段时间后loss不再下降，输出也有些问题使用的config如下： ```python # Copyright (c) OpenMMLab. All rights reserved. import torch from datasets import load_dataset from mmengine.dataset im…

aiyinyuedejustin updated 6 months ago
4
jessevig/bertviz #128

Any plan on upadating the code for LLaMA models?

Thank you for the great repo. Is there any plan from your side to update the code for LLaMA model? or is there anything I can do to update the codes to visualize LLaMA model?

iBibek updated 2 months ago
11
lm-sys/FastChat #3055

Using train_with_template on mistral end up in a model with …

I use `train_with_template.py` with `mistralai/Mistral-7B-Instruct-v0.2` ``` torchrun --nproc_per_node=2 --master_port=20001 fastchat/train/train_with_template.py \ --model_name_or_path mistr…

christobill updated 7 months ago
3
langchain-ai/langchainjs #4526

ChatMistralAI has error: Failed to execute 'decode' on 'Text…

Trying to use this ChatMistralAI for streaming from this instruction https://js.langchain.com/docs/integrations/chat/mistral but got error ``` TypeError: Failed to execute 'decode' on 'TextDecoder…

logancyang updated 2 weeks ago
11
mistralai/client-python #90

Client exception when working with pytorch.

```python from dotenv import load_dotenv import torch from mistralai.client import MistralClient print(torch.tensor([1, 2, 3])) load_dotenv() mistral = MistralClient() print("hello world"…

saurabhmahra91 updated 1 month ago
1
pratyushasharma/laser #4

Mistral Support

Hi, Great work on this! Is Mistral supported? Right now I only see GPT-J and Llama 2. Thank you!

fakerybakery updated 8 months ago
16
microsoft/semantic-kernel #7565

Python: New Feature: Anthropic AI connector for Python

Claude is a strong language model that many users would like to use for their application. If the SK team thinks this something that this is be valuable for semantic-kernel, I'd like to help…

andrewldesousa updated 2 months ago
1
Mozilla-Ocho/llamafile #244

Should I increase the KV cache size or reduce n_batch?

Error: ``` [1707364943] update_slots : failed to find free space in the KV cache, retrying with smaller n_batch = 256 [1707364943] update_slots : failed to find free space in the KV cache, retryi…

caol64 updated 4 months ago
4
Lightning-AI/litgpt #1237

LongLora fine-tuning support

[LongLora](https://arxiv.org/abs/2309.12307) is "an efficient fine-tuning approach that extends the context sizes of pre-trained large language models". They propose to fine-tune a model with a sparse…

belerico updated 5 months ago
5

上一页 1...67 68 69 70 71 72 73...100 下一页

1000+ results for mistralai

1000+ results
for mistralai