I am trying to run examples/pretrain_vision_classify.sh. I am wondering if tensor parallelism and pipeline parallelism are supported for vision models ? In other words, can I use tensor-model-parallel-size or pipeline-model-parallel-size greater than 1?
I am trying to run examples/pretrain_vision_classify.sh. I am wondering if tensor parallelism and pipeline parallelism are supported for vision models ? In other words, can I use tensor-model-parallel-size or pipeline-model-parallel-size greater than 1?