tensor-parallelism Search Results

1000+ results
for tensor-parallelism

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

NVIDIA/Megatron-LM #232

Suggestion for merge (and split into) model parallel partiti…

Currently, the script of `tools/merge_mp_partitions.py` only provides merging tensor model parallelism and splitting into given pipeline model parallelism which is quite constrained for use. I suggest…

bzantium updated 12 months ago
1
NVIDIA/TensorRT-LLM #1110

convert_checkpoint.py: error: unrecognized arguments: --worl…

I am converting a Mixtral8x7B with tensor parallelism using conversion script from llama folder : python convert_checkpoint.py --model_dir ./Mixtral-8x7B-v0.1 \ --out…

mfournioux updated 3 months ago
2
NVIDIA/Megatron-LM #258

Speed comparison between tensor parallel and pipeline parall…

Hello, I have compared the training speed between tensor parallel and pipeline parallel in Megatron with a DGX A100 node. I find that when the micro-batch-size and gradient accumulation steps are bi…

kisseternity updated 9 months ago
2
microsoft/DeepSpeed-MII #243

Multi node or remote machine inference doesn't work without …

I trying to figure out why my script doesn't work without "--force_multi" param in ds_launch_str https://github.com/microsoft/DeepSpeed-MII/blob/0fe4eb86b93e8210736f3e8c671bc886af64fd67/mii/server.py…

sarathkondeti updated 2 months ago
9
aws-neuron/aws-neuron-sdk #891

Running Llama3 Returns Tensor Allocate Status 2

When running the notebook for inference using [Llama3](https://github.com/aws-neuron/aws-neuron-samples/blob/master/torch-neuronx/transformers-neuronx/inference/meta-llama-2-13b-sampling.ipynb) ```…

pedrohernandezgeladocma updated 1 month ago
3
vllm-project/vllm #6027

[Bug]: When I inference with a 1b model, tp2 latency is grea…

### Your current environment ```text PyTorch version: 2.3.0+cu121 Is debug build: False CUDA used to build PyTorch: 12.1 ROCM used to build PyTorch: N/A OS: Ubuntu 22.04.4 LTS (x86_64) GCC …

sitabulaixizawaluduo updated 5 days ago
2
mosaicml/streaming #397

Support tensor parallel/pipeline parallel

Support tensor parallel/pipeline parallel currently?

gongel updated 3 weeks ago
5
vllm-project/vllm #2395

TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ with tensor paralle…

Hi, I was able to run _TheBloke/Mixtral-8x7B-Instruct-v0.1-GPTQ_ model on 2 A10 gpus on AWS Sagemaker. I was using _ml.g5.12xlarge_ instance type. Command to run the code `python3 -m vllm.ent…

PhaneendraGunda updated 3 months ago
4
microsoft/Megatron-DeepSpeed #158

how many gpus do i need

To train a model(7B) with megatron-deepspeed, tensor_parallelism=2 pipeline_parallelism=8 how many GPUS do i need?

lyzKF updated 11 months ago
1
microsoft/DeepSpeed #4080

[BUG] apply_tensor_parallelism() is not executed in Zero3 wi…

**Describe the bug** In Hybrid Engine, the `apply_tensor_parallelism()` is not called when model inference container requires tp > 1 but self.mpu is None. For example, for a large model in Zero3, …

devamanyu updated 8 months ago
3

上一页 1...3 4 5 6 7 8 9...100 下一页

1000+ results for tensor-parallelism

1000+ results
for tensor-parallelism