phi3-testing Search Results

122 results
for phi3-testing

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ollama/ollama #5143

AMD iGPU works in docker with override but not on host

### What is the issue? Ollama is failing to run on GPU instead it uses CPU. If I force it using `HSA_OVERRIDE_GFX_VERSION=9.0.0` then I get `Error: llama runner process has terminated: signal: abo…

smellouk updated 1 month ago
21
turboderp/exllamav2 #425

Phi-3 Support

With many claiming that phi3 mini is uncannily good for it's size, and with larger, actually-useful phi3 models on the way, adding support for this arch is almost certainly worthwhile.

candre23 updated 7 months ago
8
ollama/ollama #4855

Environment variable OLLAMA_MAX_LOADED_MODELS does not seem …

### What is the issue? We are setting OLLAMA_MAX_LOADED_MODELS=4 in our systemd override file for the ollama service: ![image](https://github.com/ollama/ollama/assets/48829375/b09c1dda-a196-4b89-b34…

troy256 updated 5 months ago
9
All-Hands-AI/OpenHands #2487

Enable memgpt

Is it possible to create a memgpt feature and make it available to all the agents rather than having a separate agent like it's discussed in #530?

rezzie-rich updated 1 month ago
35
abgulati/LARS #21

There was an error when loading the LLM in the method

![error](https://github.com/user-attachments/assets/c6a351db-0074-4db7-bc68-9b6eb9f3081f) After running the app.py file and putting the model in the web_app_storage/models folder. I get the this er…

manub14 updated 2 months ago
45
langchain-ai/langchain #3799

AttributeError: 'str' object has no attribute 'page_content'

https://python.langchain.com/en/latest/use_cases/question_answering/semantic-search-over-chat.html https://github.com/hwchase17/langchain/blob/master/docs/use_cases/question_answering/semantic-sear…

SnoopyDevelops updated 5 months ago
8
microsoft/onnxruntime-genai #530

onnx for phi-3 mini

how to use onnx model for Phi-3 mini 128k for faster inference for local machine having cpu only. Can you provide the code to do it.

Shuaib11-Github updated 5 months ago
8
OpenPecha/rag_prep_tool #8

RAG0005: LLM Selection

## Description: Selecting one of the following model for the final response generation. - Phi-3-mini (4k and 128k) - Llama3-8B (8k) - google/gemma-7b ## Criteria - context length - response time …

tenzin3 updated 5 months ago
5
microsoft/onnxruntime-genai #515

segmentation fault when running long sequence input to Phi3-…

I downloaded the phi3-mini-128k-instruct-onnx model (cpu_and_mobile/cpu-int4-rtn-blocks-32) from hugging face, and used the phi3-qa.py to run text generation following the instructions in the [readme]…

houminmin updated 5 months ago
5
microsoft/onnxruntime-genai #383

Phi-3 128k `cos_cache dimension 0 should be of max_sequence_…

I've been testing out phi3-128k, but am running into issues using larger context windows (>4000) With `cuda-fp16`, anything larger than 4096 gives me a memory allocation error, which is surprising …

Ben-Epstein updated 6 months ago
4

上一页 1...6 7 8 9 10 11 12...13 下一页

122 results for phi3-testing

122 results
for phi3-testing