llama3 Search Results - Githubissues

1000+ results
for llama3

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

All-Hands-AI/OpenHands #2075

[Frontend] Multiple LLM working in sync

@mroch @li-boxuan @jeremi @penberg @JensRoland integrate a feature that can allow user to use multiple llm models in the project with their special expertise for example : when user add 3…

rishi8011 updated 2 weeks ago
9
infiniflow/ragflow #2065

Chat assistant response slow

### Describe your problem I deploy ragflow (infiniflow/ragflow:v0.9.0) on aws eks. I have two nodes to run all the dependencies ( redis, mysql, minio, elasticsearch) **Nodes detail:** RAM: 64 GB…

IamHarri updated 1 month ago
2
ollama/ollama #5547

Mixtral 8x22b inference output is empty or gibberish

### What is the issue? Mixtral 8x22b instruct outputs are either empty or gibberish. I have tried various quantizations: q4, q4_k_m, q5, etc. All seem problematic. Other models (e.g., llama3, com…

PLK2 updated 2 months ago
2
ollama/ollama #6998

Is your llama3.2 models working?

### What is the issue? llm_load_tensors: ggml ctx size = 0.13 MiB llama_model_load: error loading model: done_getting_tensors: wrong number of tensors; expected 255, got 254 llama_load_model_fro…

dhandhalyabhavik updated 1 day ago
1
NVIDIA/TensorRT-LLM #1910

fp32 LoRA support for Llama

Currently, TensorRT-LLM requires that LoRA weights dtype match the base model dtype. The check is here: https://github.com/NVIDIA/TensorRT-LLM/blob/9dbc5b38baba399c5517685ecc5b66f57a177a4c/cpp/tensor…

pankajroark updated 1 month ago
6
likejazz/llama3.cuda #1

Build error

Running make after cloning seems to result in the following: ```cpp ~/llama3.cuda$ make nvcc -DUSE_CUBLAS=1 -g -o runcuda llama3.cu -lm -lcublas /usr/include/c++/11/bits/std_function.h:435:145: …

MoffKalast updated 3 months ago
1
b4rtaz/distributed-llama #96

[New Feature] Add new route for dllama api for embeding mode…

std::vector routes = { { "/v1/chat/completions", HttpMethod::METHOD_POST, std::bind(&handleCompletionsRequest, std::placeholders::_1, &api) …

testing0mon21 updated 2 months ago
5
TheAiSingularity/graphrag-local-ollama #9

json.decoder.JSONDecodeError

When I deal with Chinese text, this is **randomly** happening during query. I check the respose. The answer is actually there. Just can't undertand why.

sipie800 updated 2 months ago
1
huggingface/tokenizers #1553

Llama-3 offset-mapping needs fixing

Opening a new issue for the previously opened issue here -- https://github.com/huggingface/tokenizers/issues/1517 Here we can see that the desired behavior for `return_offsets_mapping` from Mistral…

davidb-cerebras updated 2 days ago
11
modelscope/modelscope-agent #422

<sft>llama3上下文支持好像有点问题

### Description ![上下文1](https://github.com/modelscope/modelscope-agent/assets/56472384/a1b21f26-04d7-420a-a3b2-a5085300f243) ![上下文2](https://github.com/modelscope/modelscope-agent/assets/56472384/32…

ghtwf01 updated 5 months ago
3

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for llama3

1000+ results
for llama3