llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

stanfordnlp/dspy #506

ollama+llama2:13b redundant output

![image](https://github.com/stanfordnlp/dspy/assets/48504366/a1d22ed8-1f6c-45be-bd85-6a9449c9efc0) ![image](https://github.com/stanfordnlp/dspy/assets/48504366/f5ff917d-993f-4c38-9043-700fe2597274) …

Ren-Xin updated 1 month ago
4
ROCm/AMDMIGraphX #3596

GroupQueryAttention produces incorrect results when loaded f…

This issue occurs in the llama2 fp16 and int4 weights models, as well as a trimmed model that returns after the first GQA node.

turneram updated 2 weeks ago
4
pytorch/executorch #6388

Error from LLaMA 3.2 3B Instruct Model generation (.pte)

### 🐛 Describe the bug Currently I'm trying to test LLaMA 3.2 3B Instruct Model as you guided. but, I faced some issues during pte generation for LLaMA 3.2 3B Instruct Model with QNN @ On Device sid…

justin-Kor updated 5 days ago
18
locuslab/wanda #57

Llama2 `ExpectedMoreSplits` Exception

Command: ``` python main.py --model /data/Llama-2-7b-chat-hf/ --prune_method wanda --sparsity_ratio 0.5 --sparsity_type unstructured --save out/llama_2_7b/unstructured/wanda/ …

time-less-ness updated 3 months ago
1
aws-neuron/aws-neuron-sdk #849

RuntimeError when running llama2_inference.ipynb

Hi all, I was following the [tut here](https://github.com/aws-neuron/aws-neuron-sdk/blob/master/src/examples/pytorch/neuronx_distributed/llama/llama2_inference.ipynb) to run the trace on llama2-7B.…

crane-sapia updated 2 weeks ago
2
Watts-Lab/commonsense-statements #33

Rating methods comparison

![image](https://github.com/user-attachments/assets/bcdc2387-eb0a-4aca-a4c4-a07d755a8bac)

amirrr updated 1 week ago
1
SafeAILab/EAGLE #160

Is there a script to calculate the average acceptance length…

Hi, thanks for you great work! When I used the EAGLE-llama2-chat-7B you provided for testing, the average acceptance length I measured was lower than the value in the paper. The way I obtained it was …

Lyn-Lucy updated 1 week ago
1
pytorch-labs/gpt-fast #198

Reasons for the poor effect of Speculative Sampling

I tested the Speculative Sampling method with llama2-7b and llama2-70b on the a800, but their boost effect was almost zero and negative in most cases. llama2-7b base 103.25 tokens/s llama2-7b …

JoeNan1 updated 2 months ago
1
pytorch/torchtune #1810

[Clean up] Move evaluation configs under model directories

Currently, `evaluation.yaml` exists under the `configs/` directory. To start, we wanted to just showcase this recipes as an example, but it is a core part of the finetuning process and therefore shou…

joecummings updated 1 week ago
12
predibase/lorax #670

Throughput and Latency degradation with a single LoRA adapte…

### System Info --- **Setup Summary for LoRAX Benchmarking with Llama-2 Model:** - **Hardware**: A100 40 GB (a2-highgpu-2g) on Google Kubernetes Engine (GKE) - **Image**: ghcr.io/predibase…

kaushikmitr updated 2 weeks ago
1

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for llama2

1000+ results
for llama2