model-parallel Search Results

1000+ results
for model-parallel

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #9573

[Bug]: The driver_worker gets stuck 100% of the time, when u…

### Your current environment The output of `python collect_env.py` ```text PyTorch version: 2.4.0+cu121 OS: Ubuntu 22.04.3 LTS (x86_64) Python version: 3.10.12 (main, Nov 20 2023, 15:14:05…

Abatom updated 2 weeks ago
2
IDAES/idaes-pse #1522

ValueError from report_numerical_issues()

I am getting the following error when I try running `report_numerical_issues()` from the DiagnosticToolbox. I should note that this was working fine until I updated my WaterTAP environment--just haven…

adam-a-a updated 2 weeks ago
4
Hazoom/bert-han #3

Parallel model

I have a question: how to parallel your model using BERT?

dungdx34 updated 3 years ago
2
codelion/optillm #99

Warning: Do not support sampling multiple responses

In the README the following warning can be read: "Note that the Anthropic API, llama-server (and ollama) currently does not support sampling multiple responses from a model, which limits the availa…

Mushoz updated 1 week ago
4
ServerlessLLM/ServerlessLLM #156

[BUG] Does not support multi-GPU in vLLM

### Prerequisites - [X] I have read the [ServerlessLLM documentation](https://serverlessllm.github.io/). - [X] I have searched the [Issue Tracker](https://github.com/ServerlessLLM/ServerlessLLM/issue…

hoaaosnw updated 1 week ago
2
astropy/astropy #14916

Modelling error in sphinx parallel builds

### Description As of astropy 5.3 I get the following error when building the `dkist` documentation with parallel jobs: ``` Process ForkProcess-19: Traceback (most recent call last)…

Cadair updated 2 months ago
3
astropy/astropy #17350

`fit_info` is not propagated back out of the `parallel_fit_d…

### Description ```python fitter = TRFLSQFitter(calc_uncertainties=True) spice_model_fit = parallel_fit_dask( data=spice, model=average_fit, fitter=fitter, fitting_axes=0, …

Cadair updated 2 weeks ago
2
YuanGongND/ltu #9

Model Parallelization

Hie, Thanks for opensourcing this amazing work. Is there any parameter to parallize the model to run on smaller gpus. I was not able to find one in config. As suggested in readme "we should turn on m…

BhashaBluff updated 10 months ago
5
vllm-project/vllm #4938

[Usage]: How to reload model when tensor_parallel_size > 1 ?

### My Python Script ```python import os os.environ["CUDA_VISIBLE_DEVICES"] = "2,3" import time import torch from vllm import LLM, SamplingParams import gc from vllm.model_executor.parallel_…

qy1026 updated 4 weeks ago
4
modelscope/ms-swift #2440

When deploying Qwen-2VL using Swift, the generated results e…

I deployed Qwen-2VL-72B using Swift, but during multi-image content inference, the generated results consistently terminate early. Could you advise on how to resolve this? The startup script is as …

gt-liuzijie01 updated 1 week ago
3

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for model-parallel

1000+ results
for model-parallel