mistral-large Search Results

1000+ results
for mistral-large

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

neuralmagic/guidellm #38

guidance_report.json from default flow is very large

After running the default flow on Mistral in vLLM, there is a large (>100MB) report json in the directory I ran the commands. This seems quite heavy-weight, especially for a json file. Instead, I …

mgoin updated 2 months ago
3
unslothai/unsloth #960

KeyError: 'base_model.model.model.layers.0.mlp.down_proj.lor…

Hi, I'm trying to fine-tune the Llama3.1 8b model but after fine-tuning it uploading it to HF, and when trying to run it using vLLM I get this error "KeyError: 'base_model.model.model.layers.0.mlp.dow…

Iven2132 updated 1 month ago
4
microsoft/LongRoPE #8

Evolutionary search parameters

Hello. Can you please tell me which evolutionary search hyperparameters (population_size, mutation_numbers, crossover_size, etc.) you used to 8x increase the context length of the Mistral v0.1 or LLaM…

Shirobokov-Andrew updated 2 months ago
2
e2b-dev/fragments #70

SyntaxError: JSON.parse: unexpected end of data

Hello and thank you for the great product. I experience a this trouble when I try to use it with local llama models. At first it starts to generate some code and somewhere in the middle I receiv…

stoykovstoyk updated 1 month ago
4
ollama/ollama #2642

🚀🔍 GPU Mystery: Unleashing the Power on Small Models but Stu…

Hi Using Ubuntu 22. both commands nvcc --version and nvidia-smi are showing valied outputs. I've noticed that the GPU is not utilized when running larger models (e.g., MiXtral8x7B, Llama 70B), …

jaifar530 updated 8 months ago
4
ChatGPTNextWeb/ChatGPT-Next-Web #4030

[Feature] Plans to add model provider support

There have been many discussions in the community regarding support for multiple models. - ChatGPTNextWeb#3484 - ChatGPTNextWeb#3923 - ChatGPTNextWeb#960 - ChatGPTNextWeb#3431 - ChatGPTNextWeb#…

fred-bf updated 2 months ago
20
junhwi/next-gen-ai #35

24/07/28

Llama 3.1 https://ai.meta.com/blog/meta-llama-3-1/ https://ai.meta.com/research/publications/the-llama-3-herd-of-models/ https://about.fb.com/news/2024/07/open-source-ai-is-the-path-forward/ A…

shylee2021 updated 3 months ago
1
samuel-vitorino/lm.rs #8

About the speed

Hello, this is a minimal rust code comparable with llama.c, however, interms of speed, how much lower compare with other pure rust lib? AFAIK, there are mistral.rs etc did almost same thing.

MonolithFoundation updated 1 month ago
4
deepset-ai/haystack-core-integrations #592

Support Mistral API

**Is your feature request related to a problem? Please describe.** We extend OpenAIChatGenerator for MistralChatGenerator. This works for chat completion but not for function calling. Mistral's funct…

bilgeyucel updated 7 months ago
2
junhwi/next-gen-ai #14

24/03/03

## AAAI-24 Benchmarking Large Language Models in Retrieval-Augmented Generation https://arxiv.org/abs/2309.01431 Hot or Cold? Adaptive Temperature Sampling for Code Generation with Large Langua…

junhwi updated 8 months ago
3

上一页 1...5 6 7 8 9 10 11...100 下一页

1000+ results for mistral-large

1000+ results
for mistral-large