mistralai Search Results

1000+ results
for mistralai

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/transformers #30641

Error converting from PyTorch to HuggingFace - Mistral / Mi…

### System Info A100 40GB RAM 32GB ### Who can help? @ArthurZucker, @younesbelkada ### Information - [x] The official example scripts - [ ] My own modified scripts ### Tasks - [ ]…

efenocchi updated 3 months ago
3
BerriAI/litellm #3855

[Bug]: json_mode doesn't work with deepinfra. The parameter …

### What happened? I can't make json_mode work with deepinfra through litellm, but it works just fine if I use deepinfra directly via the requests library. Below a small snippet to reproduce t…

miquelferrarons updated 4 months ago
1
ROCm/clr #78

[Issue]: hipMemcpyWithStream causes severe stall in Hugginfa…

### Problem Description Hi, When doing text generation with Mistral 7b with Hugginface transformers on a MI100 GPU, I can see in the collected torch trace that a lot of time is wasted due a hipMem…

Epliz updated 1 week ago
3
h2oai/h2ogpt #1670

Running H2ogpt with Ollama inference Server

Hi All, I am trying to run an inference server on Ollama using the below script: ollama run mistral:v0.3 Then running h2o-gpt using the below script: python generate.py --guest_name='' --b…

rohitnanda1443 updated 3 months ago
2
microsoft/DeepSpeed #4808

[BUG] Deepspeed Zero 3 Inference InFlight Params with new Hu…

**Describe the bug** I tried running deepspeed zero 3 on a new huggingface model and got the following error: [2023-12-13 04:12:18,837] [WARNING] [parameter_offload.py:86:_apply_to_tenso…

ryandeng1 updated 5 months ago
37
DAMO-NLP-SG/VideoLLaMA2 #81

Cannot reproduce results on vllava datasets

Dear authors of VideoLLaMA2, Thanks for the great work. We tried to reproduce your results on vllava datasets using the latest version of the code. However, we observe a large discrepancy in the thre…

williamium3000 updated 1 week ago
20
abetlen/llama-cpp-python #1121

Post Version 2.25 AMD GPU 6650M Models Generate Garbage Outp…

# Expected Behavior I use llama-cpp-python on a non-GPU system and on a AMD GPU 6650 on Linux (POP OS 22.04).This report is for the AMD GPU system. The non-GPU system outputs results fine. The AMD G…

ganakee updated 8 months ago
1
janhq/jan #2854

feat: support Nvidia NIMs remote API

**Problem** A clear and concise description of what the problem is. Ex. I'm always frustrated when [...] Request from user: https://build.nvidia.com/explore/discover so this is something I came…

0xSage updated 1 month ago
6
Soulter/hugging-chat-api #204

how to chose llama3 70B in model, index 1 or 2 only CohereFo…

list model i get is { "models": [ { "datasetName": null, "datasetUrl": null, "description": "Command R+ is Cohere's latest LLM and is the first op…

vuemquan updated 2 months ago
4
OSU-NLP-Group/TravelPlanner #24

Use mistral as planner

Hi team, Thank you for the great work. I tried to replicate the part for using mistral as planner and I noticed that in [tool_agent.py](https://github.com/OSU-NLP-Group/TravelPlanner/blob/main/agen…

lingchensanwen updated 3 months ago
2

上一页 1...90 91 92 93 94 95 96...100 下一页

1000+ results for mistralai

1000+ results
for mistralai