Currently, the script of tools/merge_mp_partitions.py only provides merging tensor model parallelism and splitting into given pipeline model parallelism which is quite constrained for use. I suggest to enlarge the script for merging both tensor and pipeline parallelism and also provide a script for splitting checkpoint into partitions separately.
Currently, the script of
tools/merge_mp_partitions.py
only provides merging tensor model parallelism and splitting into given pipeline model parallelism which is quite constrained for use. I suggest to enlarge the script for merging both tensor and pipeline parallelism and also provide a script for splitting checkpoint into partitions separately.