llama-3 Search Results - Githubissues

1000+ results
for llama-3

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #9451

[Feature]: Consider parallel_tool_calls parameter at the API…

### 🚀 The feature, motivation and pitch Currently, there is a [parallel_tool_calls](https://github.com/vllm-project/vllm/blob/18b296fdb2248e8a65bf005e7193ebd523b875b6/vllm/entrypoints/openai/protocol…

lucasalvarezlacasa updated 6 days ago
7
meta-llama/llama #1158

Meta-Llama-3.1-70B-Instruct does not appear to have a file n…

I submitted a request for access and obtained a key from the following URL: [https://llama.meta.com/llama-downloads/](https://llama.meta.com/llama-downloads/) Instructions refer to download refer t…

jcruzer2012 updated 2 months ago
2
casper-hansen/AutoAWQ #452

LLaMA-3 issues when used with vLLM

I tried these two quantization approaches: ``` model_path = '/home/catid/models/Meta-Llama-3-70B-Instruct' quant_path = 'cat-llama-3-70b-q128-w4-gemvfast' quant_config = { "zero_point": True, "q…

catid updated 6 months ago
5
huggingface/chat-ui #1073

Support for Llama-3-8B-Instruct model

hi, For model meta-llama/Meta-Llama-3-8B-Instruct, it is unlisted, not sure when will be supported? https://github.com/huggingface/chat-ui/blob/3d83131e5d03e8942f9978bf595a7caca5e2b3cd/.env.templa…

cszhz updated 6 months ago
2
vllm-project/vllm #7811

[Bug]: Llama 3 answers starting with <|start_header_id|>assi…

### Your current environment The output of `python collect_env.py` ```text Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch…

erickrf updated 1 month ago
1
huggingface/trl #2095

No v_head weight is found

### System Info WARNING:root:A model is loaded from './saved_models/fp-meta-llama3', and no v_head weight is found. ### Information - [ ] The official example scripts - [X] My own modified scripts…

BUILDERlym updated 3 weeks ago
5
deeplearning-wisc/haloscope #1

Reproduction Problem in TruthfulQA

When i try to reproduce the result following the instruction in READEME, I get the following result in TruthfulQA for Llama-2-7b. AUROC is **60.36**, which is far from **78.64** in Table 1. The full o…

Tan-Hexiang updated 1 week ago
7
run-llama/llama_index #16786

[Question]: No embedding in query_engine result

### Question Validation - [X] I have searched both the documentation and discord for an answer. ### Question I have the code below and it runs well. However when I inspect the 'response' variable t…

JohnTaco93 updated 1 week ago
2
NVIDIA/TensorRT-LLM #2251

The problem of repeated output of large models in llama3

Hello everyone, I have a problem and would like to ask for help. After I compile and run the inference code run.py, if I set max_output_len to a small value, the output will be truncated before it is …

qimingyangyang updated 2 weeks ago
3
hiyouga/LLaMA-Factory #5431

Latest LLaMA-Factory repo force to use Troch 2.4 hence is cl…

### Reminder - [X] I have read the README and searched the existing issues. ### System Info Latest LLaMA-Factory repo 12Septr2024 forces to use Torch 2.4 hence is clashing with Unsloth/XFormers ##…

thusinh1969 updated 2 months ago
3

上一页 1...65 66 67 68 69 70 71...100 下一页

1000+ results for llama-3

1000+ results
for llama-3