mistral-large Search Results

1000+ results
for mistral-large

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #2096

P40 with USE_FLASH_ATTENTION=False

### System Info Linux k8s-node2 6.5.0-41-generic #41~22.04.2-Ubuntu SMP PREEMPT_DYNAMIC Mon Jun 3 11:32:55 UTC 2 x86_64 x86_64 x86_64 GNU/Linux nvcc: NVIDIA (R) Cuda compiler driver Copyright (c…

ltm920716 updated 4 months ago
2
Jeffser/Alpaca #139

Alpaca uses my CPU instead of my GPU (AMD)

I have noticed that Alpaca uses my CPU instead of my GPU. Here's a screenshot showing how it's using almost 40% of my CPU, and only 1% of my GPU. ![Captura desde 2024-07-10 06-51-39](https://github…

frandavid100 updated 1 week ago
110
yufeikang/raycast_api_proxy #43

Country, region, or territory not supported

I tried using the openai base url locally(via curl) and it worked. this is the log: ```info raycast | 2024-07-14 11:10:11,112 MainThread main.py :72 INFO : Received request to /api/v1/me …

littleblack111 updated 4 months ago
7
danny-avila/LibreChat #3229

[Bug]: Remote hosting with nginx as reserve proxy with SSL. …

### What happened? I'm setting up librechat on a remote hosting with nginx. The register and interface is working fine. I've followed the tutorial https://www.librechat.ai/docs/remote/nginx and im do…

citric-gabriel updated 4 months ago
1
Yannael/multilingual-embeddings #1

How the value of 'pooling_type' is determined?

For each model, how is the value (`cls`, `mean`, `last_token`) of `pooling_type` determined? ```python embeddings_model_spec = { } embeddings_model_spec['E5-mistral-7b']={'model_name':'intfloat/…

LeMoussel updated 6 months ago
1
langchain-ai/langchain #17906

Vllm not able to set max_model_len

### Checked other resources - [X] I added a very descriptive title to this issue. - [X] I searched the LangChain documentation with the integrated search. - [X] I used the GitHub search to find a sim…

shivrajjadhav733 updated 5 months ago
2
vllm-project/vllm #4909

[Misc]: can vllm support long content inference like 800k

### Anything you want to discuss about vllm. we will finetune a 70B model that support long content with 800k, can vllm support to inference this model?

yunll updated 6 months ago
1
severian42/GraphRAG-Local-UI #6

The *_final*.parquet files are not being created

I've downloaded the latest code and run the indexing. The *_final*.parquet files are not being created in the output/artifacts directory. I ran the GraphRAG from command line using Microsoft git re…

KannamSridharKumar updated 4 months ago
20
huggingface/text-embeddings-inference #205

Support for SFR-Embedding Mistral

### Model description It would have been awesome if TEI supports SFR-Embedding-Mistral, which figures on the top of the mteb : https://huggingface.co/Salesforce/SFR-Embedding-Mistral ### Open source…

prasannakrish97 updated 4 months ago
8
michaelfeil/infinity #136

Content-Encoding: gzip

I wonder if it would make sense to support compressed requests, esp. for /rerank, where the query and document list could be many 1k or 2k chunks of text? The incoming request could easily exceed 20 …

andrew-at-rise updated 2 months ago
7

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for mistral-large

1000+ results
for mistral-large