distributed-llm Search Results

1000+ results
for distributed-llm

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

wdndev/tiny-llm-zh #7

关于transformers无法识别模型类型 ValueError: The checkpoint you are tr…

在网络上尝试修改transformers版本号同时修改模型中的config中为对应版本号也没有解决 { "architectures": [ "TinyllmForCausalLM" ], "attention_dropout": 0.0, "hidden_act": "silu", "hidden_size": 512, "initializer…

1190201205 updated 1 week ago
1
vllm-project/vllm #5779

[Bug]: 使用vllm+ray分布式推理报错

### Your current environment Python==3.10.14 vllm==0.5.0.post1 ray==2.24.0 Node status --------------------------------------------------------------- Active: 1 node_37c2b26800cc853721ef351c…

JKYtydt updated 2 weeks ago
8
NVIDIA/TensorRT-LLM #1886

smoothquant on starcoder2

Hi, I'm having issue when trying to convert starcoder2-3b with smoothquant to trtllm. I'm running on a100-40gi. This is my commad: `python tensorrt_llm/examples/gpt/convert_checkpoint.py --mod…

tonylek updated 1 week ago
4
intel-analytics/ipex-llm #11415

inference error: mistral and codellama have issue 'object h…

GPU: 2 ARC CARD running following example, [inference-ipex-llm](https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/example/GPU/Pipeline-Parallel-Inference) **for mistral and codell…

raj-ritu17 updated 3 weeks ago
1
vllm-project/vllm #6329

[Bug]: Gloo 库无法在两台计算机之间进行通信

### Your current environment Collecting environment information... PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubunt…

JKYtydt updated 7 hours ago
4
intel-analytics/ipex-llm #11409

ImportError: undefined symbol: iJIT_NotifyEvent on 2-ARC GPU

Trying to do inference on arc GPU machine, have followed this guidelines: ``` https://github.com/intel-analytics/ipex-llm/tree/main/python/llm/example/GPU/Pipeline-Parallel-Inference and run_mi…

raj-ritu17 updated 3 weeks ago
1
maitrix-org/llm-reasoners #61

Too few parameters for <class 'reasoners.algorithm.mcts.MCTS…

``` TypeError: Too few parameters for ; actual 2, expected 3 [2024-04-12 07:26:48,924] torch.distributed.elastic.multiprocessing.api: [ERROR] failed (exitcode: 1) local_rank: 0 (pid: 1263) of binary…

nico1995lee updated 2 months ago
5
NVIDIA/TensorRT-LLM #1741

Quantizing Phi-3 128k Instruct to FP8 fails.

### System Info - GPU name: L40s - CUDA: 12.1 ``` Wed Jun 5 16:27:21 2024 +-----------------------------------------------------------------------------------------+ | NVIDIA-SMI 550.54.14 …

kalradivyanshu updated 1 week ago
10
xorbitsai/inference #1622

BUG: NCCL error:

使用v0.12.0docker镜像部署，启动命令如下： sudo docker run -d -v /home/tskj/MOD/:/home/MOD/ -e XINFERENCE_HOME=/home/MOD -p 9997:9997 --gpus all xprobe/xinference:v0.12.0 xinference-local -H 0.0.0.0 --log-level de…

ye7love7 updated 4 days ago
11
OpenBMB/MiniCPM-V #215

我的M3芯片本地运行MiniCPM-Llama3-V-2_5-int4得到了报错Using `bitsandbytes`…

### 是否已有关于该错误的issue或讨论？ | Is there an existing issue / discussion for this? - [X] 我已经搜索过已有的issues和讨论 | I have searched the existing issues / discussions ### 该问题是否在FAQ中有解答？ | Is there an existing ans…

myBigbug updated 11 hours ago
9

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for distributed-llm

1000+ results
for distributed-llm