llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

yangdongchao/LLM-Codec #2

missing file local_embedding_path: embed_llama2.pt

there is an error when trying to load the model the error is in the model itself checkpoint = torch.load(local_embedding_path, map_location="cpu")['weight'] this apparently expects embed_ll…

nkundiushuti updated 3 months ago
17
rese1f/MovieChat #76

Weird outputs

why does some outputs look like this: ``` Moviepy - Done ! Moviepy - video read…

ssantos97 updated 1 month ago
19
abertsch72/unlimiformer #60

LLama2_example output random words

I have tried your llama example and the output is **random** and took 770 second to finish: **commmand :** ``` python src/run_generation.py --model_type llama --model_name_or_path meta-llama/Ll…

KerolosAtef updated 5 months ago
1
b4rtaz/distributed-llama #98

dllama: src/commands.cpp:102: MultiHeadAttSlice::MultiHeadAt…

Hello, @b4rtaz! I'm trying to run model [nkpz/llama2-22b-chat-wizard-uncensored](https://huggingface.co/nkpz/llama2-22b-chat-wizard-uncensored) on a cluster composed of 1 Raspberry Pi 4B 8 Gb and 7…

EntusiastaIApy updated 2 months ago
3
karpathy/llama2.c #288

unable to convert llama2 7b model

hi, I am trying to convert the llama2 7b model by below script. python export_meta_llama_bin.py ~/projects/75_NLP/llama-main/llama-2-7b llama2_7b.bin it always popup "killed" message. My hardwa…

edisondeng updated 5 months ago
6
pytorch/executorch #6436

Llama 3 QNN export docs hard to find

### 📚 The doc issue https://github.com/pytorch/executorch/tree/main/examples/qualcomm - On the 3) qaihub_scripts, it still mentioned llama2. Can we update those to reflect the latest support for l…

Riandy updated 3 weeks ago
2
kserve/kserve #3561

Native integration with KEDA for LLM inference autoscaling

/kind feature **Describe the solution you'd like** To autoscale LLM inference services Knative's request level metrics may not be the best scaling metrics as LLM inference is performed at the toke…

yuzisun updated 6 days ago
5
pytorch/executorch #4113

examples/models/llama2 - undefined reference to `pthread_onc…

Hello, I can't run step 4 from the instruction that is available on https://github.com/pytorch/executorch/tree/main/examples/models/llama2 When I run point _2. Build llama runner._ I have an error…

paziewskib updated 3 months ago
6
getcursor/cursor #1380

Ollama (or other OpenAI API compatible local LLMs) support

Knowing that Ollama server supports OpenAI API([https://ollama.com/blog/openai-compatibility](https://ollama.com/blog/openai-compatibility)), the goal is to **point Cursor to query the local Ollama se…

micheleoletti updated 18 hours ago
44
NVIDIA/TensorRT-LLM #1172

Failed to quantize Llama2 70b fine tuned model to AWQ Int4

### System Info - CPU archtecture: x86_64 - CPU/Host memory size: 250GB total - GPU properties - GPU name: 2x NVIDIA A100 80GB - GPU memory size: 160GB total - Libraries - tensorrt @ fi…

aikitoria updated 2 days ago
5

上一页 1...13 14 15 16 17 18 19...100 下一页

1000+ results for llama2

1000+ results
for llama2