-
I followed a [zero_to_hero_guide](https://github.com/meta-llama/llama-stack/blob/main/docs/zero_to_hero_guide/00_Inference101.ipynb) and am facing this issue for
```
llama_models==0.0.54
llama_sta…
-
![Screenshot 2024-11-10 000712](https://github.com/user-attachments/assets/fee095be-aa54-48c7-8626-28b733677e54)
---
![Screenshot 2024-11-10 020119](https://github.com/user-attachments/assets/67…
-
[TensorRT-LLM] TensorRT-LLM version: 0.13.0
0.13.0
^M0it [00:00, ?it/s]^M139it [00:00, 1375.80it/s]^M201it [00:00, 1554.11it/s]
[1729020016.135793] [toyota-tom-buddy-ml-vm:879 :0] ucp_context.c:1…
-
### 🐛 Describe the bug
Dear @shewu-quic
I followed your [instructions](https://github.com/pytorch/executorch/blob/main/examples/demo-apps/android/LlamaDemo/docs/delegates/qualcomm_README.md) to…
-
https://github.com/search?q=repo%3Apytorch%2Ftorchtune+decoder_lora&type=code
![image](https://github.com/user-attachments/assets/636e0880-c56a-4c0b-a925-4745bcaa883b)
-
### What happened?
Prior to PR #9921 / Version 4081 the -ngl 0 Q4_0 llama performance was significantly higher (more than 10x) than afterwards.
(hardware: Apple MacBook Air M2 10 GPU 24GB RAM)
…
-
### Checklist
- [X] 1. I have searched related issues but cannot get the expected help.
- [X] 2. The bug has not been fixed in the latest version.
- [X] 3. Please note that if the bug-related iss…
-
Id:0->1, Child: In the graph of $\frac{x^3-4x}{q(x)}*\frac{x^3+4x}{r(x)}$$,\ each $x$ such that $\sqrt{\frac{x^5-4x}{q^3x^2+4^3}\cdot, Policy: 1.0, Value: 0.5008712234296745
0%| …
-
llama 3.2 vision is a good work!
I am doing some interesting work based on llama 3.2 vision. I have read paper about llama 3.2 vision, but I have a very important question to ask.
Below is a image…
-
llama-stack install from source:https://github.com/meta-llama/llama-stack/tree/cherrypick-working
### System Info
python -m "torch.utils.collect_env"
/home/kaiwu/miniconda3/envs/llama/lib/pytho…