llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ggerganov/llama.cpp #10252

Bug: CANN: Inference result garbled

### What happened? llama.cpp使用QWen2.5-7b-f16.gg在310P3乱码 ### Name and Version ./build/bin/llama-cli -m Qwen2.5-7b-f16.gguf -p "who are you" -ngl 32 -fa ### What operating system are you seeing the …

feichenchina updated 6 days ago
8
flashinfer-ai/flashinfer #521

Have any plans to optimize the prefill kernel for the Hopper…

I notice that the Flashinfer prefill kernel is much slower than FA3 and TRT-LLM FMHA on SM90. Do you have any plans to use some SM90 features for optimization? Here is some data I tested on an SM9…

alexngng updated 1 week ago
2
sophgo/sophon-demo #43

运行llama2-7b_int8_1dev.bmodel模型时报错

./llama2.soc llama2-7b_int8_1dev.bmodel Demo for LLama2-7B in BM1684X Init Environment ... Load tokenizer.model ... Done! Device [ 0 ] loading .... [BMRT][bmcpu_setup:406] INFO:cpu_lib 'libcpuop…

17656178609 updated 4 months ago
2
danswer-ai/danswer #1458

how to use llama2 or llama3

I install llama2 and 3 through ollama in windows,danswer is also installed in windows, ![image](https://github.com/danswer-ai/danswer/assets/106233935/6b2a3594-dd52-40e9-8dd7-74530d384ffe) ![image](…

wenlong1234 updated 4 months ago
8
joonspk-research/generative_agents #1

Replace openai to llama2?

How can I replace the OpenAI API with Llama2?

hackhy updated 6 months ago
16
vllm-project/vllm #3257

Maximize GPU utilization for increased throughput

I am using vLLM endpoint with OpenAI API to send concurrent requests to Llama2-7B model that's deployed on a single A100 GPU. Regardless of the values I set for `--block-size`, `--swap-space`, `--m…

rrichajalota updated 2 weeks ago
1
pytorch/executorch #3586

Does llama2 example on Android utilize HTP?

[https://github.com/pytorch/executorch/blob/main/examples/models/llama2/README.md#performance](https://github.com/pytorch/executorch/blob/main/examples/models/llama2/README.md#performance) AFAIK, Q…

CHNtentes updated 2 months ago
8
CASIA-IVA-Lab/AnomalyGPT #97

这里llama是用了那个版本的呀

这里llama是用了哪个版本，llama1还是llama2

Fuyu-xuan updated 1 hour ago
4
EricLBuehler/xlora #28

does xlora train not support llama2?

I've trained xlora with mistral 7b base model, it works fine. However, when switching base model to llama2 7b, it encountered an error. This is my code for training. ``` model = AutoModelForCausa…

crossxxd updated 4 months ago
4
meta-llama/llama-recipes #536

DeepSpeed support for Full Finetuning - FSDP performance is …

### 🚀 The feature, motivation and pitch I trained the current code with FSDP to full fine-tune Llama2, it is very quick, but it turns out the performance is even worse than LoRA fine-tuned models u…

waterluck updated 1 month ago
3

上一页 1...10 11 12 13 14 15 16...100 下一页

1000+ results for llama2

1000+ results
for llama2