llama2 Search Results - Githubissues

1000+ results
for llama2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

Dicklesworthstone/llm_aided_ocr #3

Support APIs

Is there any plan to restructure the code to be uniform to use it with Llama2/API like (gpt-3.5-turbo, gpt-4) to use this PDF-to-text in any hardware. https://github.com/Dicklesworthstone/llama2_ai…

ayoubelmhamdi updated 12 months ago
2
abetlen/llama-cpp-python #653

loading error in llama cpp /llama2

llama.cpp: loading model from models\llama-2-7b-chat.ggmlv3.q8_0.bin error loading model: unknown (magic, version) combination: 67676a74, 00000003; is this really a GGML file? llama_init_from_file…

Bhavya-TR updated 1 year ago
2
NVIDIA/TensorRT-LLM #792

[Feature Request] support YaRN request

Feature request Nous Research and EleutherAI have released the YaRN model, which comes in two versions with context sizes of 64k and 128k. This model utilizes RoFormer-style embeddings, distinguishin…

kkr37 updated 3 days ago
3
AniZpZ/AutoSmoothQuant #24

llama2-7b-chat量化完推理报错

Loading checkpoint shards: 100%|████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████████| 2/2 [00:42

AlexMa0 updated 4 months ago
5
microsoft/Megatron-DeepSpeed #443

llama3 and llama3.1 support

When Megatron-DeepSpeed support llama3/llama3.1 pretraining?

fmiao2372 updated 1 week ago
1
microsoft/LongRoPE #8

Evolutionary search parameters

Hello. Can you please tell me which evolutionary search hyperparameters (population_size, mutation_numbers, crossover_size, etc.) you used to 8x increase the context length of the Mistral v0.1 or LLaM…

Shirobokov-Andrew updated 2 months ago
2
NVIDIA/TensorRT-LLM #1565

[Quantization] Long latency for generating first token

## Environment - RTX8000 GPU - TensorRT-LLM v0.9.0 ## Model - LLaVA v1.5 7B (LLaMA2 7B) - fp16 and int8/int4 weight quantization - batchsize = 16 ## Script - official `examples/multimodal/run.…

youki-sada updated 3 days ago
7
microsoft/DeepSpeedExamples #814

Llama2 as actor using zero_stage3

Hello! Did anyone meet the following bug when using zero_stage3 for Lllama2? step3_rlhf_finetuning/rlhf_engine.py:61 in __init__ │ │ …

George-Chia updated 11 months ago
1
YunchaoYang/Blogs #58

Serve large language models

# 1. Ollama ## 1. use Ollama CLI: ``` ollama serve ollama run llama2:7b, llama3, llama3:70b, mistral, dophin-phi, phi, neural-chat, codellama, llama2:13b, llama2:70b ollama list ollama show …

YunchaoYang updated 4 months ago
6
pytorch/captum #1366

Use meta-llama/Llama-3.2-3B-Instruct,get unexpected result.

## 🐛 Bug ## To Reproduce Steps to reproduce the behavior: I followed [https://captum.ai/tutorials/Llama2_LLM_Attribution](url) My code is here，the only difference is I changed the model_…

Ningshiqi updated 1 month ago
1

上一页 1...27 28 29 30 31 32 33...100 下一页

1000+ results for llama2

1000+ results
for llama2