tensor-parallelism Search Results

1000+ results
for tensor-parallelism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

microsoft/DeepSpeed #3468

[REQUEST] Examples for tensor parallelism and pipeline paral…

I noticed there are some settings about tensor parallelism in `DeepSpeedEngine` and `PipielineEngine`. Can you please provide us with some examples of combinig tensor parallelism with pipeline paralle…

x54-729 updated 6 months ago
2
triton-inference-server/tensorrtllm_backend #179

how to enable tensor parallelism

i build my model with --tp_size 2 --world_size 2, and put two generated model files into the backend directory and use the default config.pbtxt. then i run the script/launch_triton_server.py --model_…

sfireworks updated 6 months ago
4
state-spaces/mamba #401

Bug in use_mem_eff_path with ngroups>1?

It seems that there is a bug when using the `use_mem_eff_path` feature, when `ngroups` is greater than 1. The loss curve initially decreases but then stabilizes around a constant value and fails to co…

patronum08 updated 3 days ago
3
huggingface/transformers #10321

[Tensor Parallelism] Megatron-LM to transformers

# 🚀 Feature request Splitting the discussion that started here: https://github.com/huggingface/transformers/pull/10301#issuecomment-782917393 to add the potential future feature of transformers and…

stas00 updated 5 months ago
9
inferflow/inferflow #49

Does opt_13b model support tensor parallelism vias inferflow…

### The settings are as followed: devices = 0&1&2&3;4&5&6&7 decoder_cpu_layer_count = 0 cpu_threads = 8 max_concurrent_queries = 6 return_output_tensors = true ;debug options is_study_mod…

LHQUer updated 3 months ago
2
NVIDIA/TensorRT-LLM #472

Does ATQ work with tensor parallelism?

I've been using `atq.INT4_AWQ_CFG` and observing a performance drop when quantizing a Llama 70B model with tensor parallelism with`atq.quantize(model, quant_cfg, forward_loop=calibrate_loop)`. Quan…

theophilegervet updated 6 months ago
1
pytorch/pytorch #128636

Expected grad_output types don't match available grad_output…

### 🐛 Describe the bug I was trying to distribute some model using tensor parallelization, but I ran into a grad output type mismatch when I enabled compile. Note, I was not using loss parallel her…

schinmayee updated 2 weeks ago
3
deepjavalibrary/djl-serving #1092

tensor parallelism across multiple GPU's

I am following the code as mentioned in the AWS documentation to host GPT-J-6B using DJL serving [ https://github.com/aws/amazon-sagemaker-examples/blob/main/advanced_functionality/pytorch_deploy_l…

samanthvishwas updated 9 months ago
1
CoinCheung/gdGPT #18

Any plan to incorporate tensor parallelism or zero data para…

Would it be possible in this framework that the pipeline is incorporated to tensor parallelism or zero data parallelism?

GeneZC updated 9 months ago
2
huggingface/optimum-nvidia #68

Error when Running LLAMA with tensor parallelism = 2

I am unable to get the llama example to work with tensor parallelism. I have 2x L4 machines NVIDIA-SMI 525.105.17 Driver Version: 525.105.17 CUDA Version: 12.0 When running the script htt…

TheCodeWrangler updated 5 months ago
1

上一页 1...1 2 3 4 5 6 7...100 下一页

1000+ results for tensor-parallelism

1000+ results
for tensor-parallelism