llama3-2 Search Results

1000+ results
for llama3-2

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

huggingface/text-generation-inference #2757

The same model, but different loading methods will result in…

### System Info TGI version latest;single NVIDIA GeForce RTX 3090； ### Information - [X] Docker - [ ] The CLI directly ### Tasks - [X] An officially supported command - [ ] My own modifications …

hjs2027864933 updated 1 week ago
1
iree-org/iree #18923

codegen producing register count overflow for hip target gfx…

### What happened? I'm getting a register count overflow when trying to run llama3.1_405b_fp16 for 8 HIP devices targeting gfx942 ```iree/runtime/src/iree/vm/bytecode/verifier.c:345: RESOURCE_EXHAUST…

KyleHerndon updated 1 month ago
38
predibase/lorax #679

Not able to host Llama3.2-11b on Azure A100 80GB server

### System Info lorax_version=0.12.0 Using Docker to host the 11b model it runs perfectly for Llama3.1-8b But with LLama3.2-11b I am getting the following error ModuleNotFoundError: No module…

alokgupta1996 updated 2 weeks ago
2
codelion/optillm #110

Running with Ollama

Thanks for this interesting project. I got to know about this project while using Ollama. Since Ollama doesn't support log_prob, I was interested to try Optillm. I have been trying for the last fe…

OriginalGoku updated 6 hours ago
4
expectedparrot/edsl #1308

Initializing Model objects for different model names can tak…

```python from edsl import Model import time models_list = [['Austism/chronos-hermes-13b-v2', 'deep_infra', 0], ['BAAI/bge-base-en-v1.5', 'together', 1], ['BAAI/bge-large-en-v1.5', 'together', …

zer0dss updated 1 week ago
2
modelscope/ms-swift #2507

想问一下怎么把swift sft参数转成torchrun命令行参数

使用脚本如下，请问怎么转成torchrun的命令？ ``` NPROC_PER_NODE=8 CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 swift sft \ --model_type llava1_6-llama3_1-8b-instruct \ --model_id_or_path .cache/modelscope/hub/swift/…

JHL328 updated 4 days ago
1
aws-neuron/aws-neuron-sdk #891

Running Llama3 Returns Tensor Allocate Status 2

When running the notebook for inference using [Llama3](https://github.com/aws-neuron/aws-neuron-samples/blob/master/torch-neuronx/transformers-neuronx/inference/meta-llama-2-13b-sampling.ipynb) ```…

pedrohernandezgeladocma updated 2 weeks ago
3
BradyFU/Awesome-Multimodal-Large-Language-Models #179

Add Molmo and Llama3.2 (11B and 90B)

Ashraphie updated 1 month ago
1
NVIDIA/TensorRT-LLM #2467

Error convert_checkpoint in TensorRT-LLM 0.13.0 for Llama3.2…

hellow, I failed to covert trt-llm Llama3.2 3B when I tried to run convert_checkpoint.py. (like this link - https://github.com/NVIDIA/TensorRT-LLM/issues/2339) I want to know if Llama3.2 3B model con…

yspch2022 updated 1 week ago
3
b4rtaz/distributed-llama #131

Model is not supported: llama3_2_3b_instruct_q40

Hi, I want to work with the newly added model llama3_2_3b_instruct_q40, but it shows me error when downloading the model in docker container. I checked the launch.py and the issue is caused by this …

Znbne updated 1 month ago
6

上一页 1...2 3 4 5 6 7 8...100 下一页

1000+ results for llama3-2

1000+ results
for llama3-2