tensor-parallelism Search Results

1000+ results
for tensor-parallelism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

vllm-project/vllm #7878

[Bug]: Requests larger than 75k input tokens cause `Input p…

### Your current environment The output of `python collect_env.py` ```text Collecting environment information... PyTorch version: 2.4.0+cu121 Is debug build: False CUDA used to build PyTor…

servient-ashwin updated 1 month ago
6
huggingface/transformers #32864

Multiprocessing support

Running model forwards within a process seems to get stuck. I tried to set `TOKENIZERS_PARALLELISM` to `true` and `false` but unfortunately both couldn't help 🥲 ### System Info `transformers-cli…

keyboardAnt updated 1 hour ago
6
state-spaces/mamba #401

Bug in use_mem_eff_path with ngroups>1?

It seems that there is a bug when using the `use_mem_eff_path` feature, when `ngroups` is greater than 1. The loss curve initially decreases but then stabilizes around a constant value and fails to co…

patronum08 updated 3 months ago
3
ray-project/ray #47169

[core][aDAG] NestedTorchTensorNcclChannel should automatical…

### What happened + What you expected to happen ```python @pytest.mark.parametrize("ray_start_cluster_head_with_env_vars", [ { "include_dashboard": True, "env_va…

kevin85421 updated 1 month ago
1
ray-project/ray #36650

[vLLM/Serve] Create polished vLLM example on a Serve deploym…

The example should show tensor parallelism. I am not sure if Serve + vLLM + tensor parallelism works at the moment because the Serve deployment will request N GPUs, then each vLLM worker will request …

cadedaniel updated 11 months ago
4
vllm-project/vllm #8898

[Performance]: Talk about the model parallelism

### Proposal to improve performance _No response_ ### Report of performance regression _No response_ ### Misc discussion on performance Hi, Thank you for your contribution to the LLM community…

baifanxxx updated 6 days ago
6
pytorch/TensorRT #3092

❓ [Question] Is there any way to deploy on a single machine …

## ❓ Question ## What you have already tried ## Environment > Build information about Torch-TensorRT can be found by turning on debug messages - PyTorch Version (e.g., 1.0): - C…

SZ-ing updated 1 month ago
1
vllm-project/vllm #5143

[Usage]: how should I do data parallelism using vLLM?

### Your current environment ```text Collecting environment information... PyTorch version: 2.2.1+cu118 Is debug build: False CUDA used to build PyTorch: 11.8 ROCM used to build PyTorch: N/A …

YuWang916 updated 1 month ago
5
microsoft/DeepSpeed-MII #329

Is pipeline parallelism supported?

I didn't see any documentation that mentions that.

sleepwalker2017 updated 3 months ago
11
NVIDIA/TransformerEngine #746

[Question] Why Tensor parallel communication/GEMM overlap ca…

In Megatron, I find that the check for `tp_comm_overlap` and `sequence_parallel`。 ``` if args.tp_comm_overlap: assert args.sequence_parallel == True, 'Tensor parallel communicatio…

hxdtest updated 4 months ago
1

上一页 1...4 5 6 7 8 9 10...100 下一页

1000+ results for tensor-parallelism

1000+ results
for tensor-parallelism