Closed kojimano closed 1 year ago
#GPUs | Size | DP | MP | PP | MBS | GBS | SL | Scattered | Interleaved | AC/DAC | Max Mem (allocated) | Max Mem (reserved) | TFLOPs | Sec/it | Notes |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
32 | 10.1B | 1 | 4 | 8 | 1 | 88 | 2048 | Yes | No | Yes | 7122 MiB | 7370 MiB | 39.4 | 12.1 | 4/28 |
32 | 10.1B | 1 | 4 | 8 | 2 | 88 | 2048 | Yes | No | Yes | 7251 MiB | 7620 MiB | 40.2 | 11.8 | 4/28 |
32 | 10.1B | 1 | 4 | 8 | 4 | 88 | 2048 | Yes | No | Yes | 7731 MiB | 9296 MiB | 37.2 | 12.8 | 4/28 |
32 | 10.1B | 1 | 4 | 8 | 8 | 88 | 2048 | Yes | No | Yes | 7698 MiB | 10056 MiB | 37.3 | 12.8 | 4/28 |
32 | 10.1B | 1 | 4 | 8 | 2 | 80 | 2048 | Yes | 3 | Yes | 7347 MiB | 7938 MiB | 41.3 | 10.5 | 4/28 |
32 | 10.1B | 1 | 4 | 8 | 1 | 96 | 2048 | Yes | 2 | Yes | 8532 MiB | 9020 MiB | 41.5 | 12.5 | 4/28 |
32 | 10.1B | 1 | 4 | 8 | 1 | 96 | 2048 | No | 2 | Yes | 8532 MiB | 9020 MiB | 35.8 | 14.5 | 4/28 |
32 | 10.1B | 1 | 4 | 8 | 2 | 96 | 2048 | Yes | 2 | Yes | 7107 MiB | 7782 MiB | 38.7 | 13.4 | 4/28 |
32 | 13 B | 1 | 4 | 8 | 2 | 96 | 2048 | Yes | 2 | Yes | - MiB | - MiB | - | - | 4/28 |
Model benchmarking results
Overview
Model hyperparameters
Notations
Preliminary Experiments
Memory usages seems to increase after logging?
Experiments-1
Deepspeed (Reduce PP bubble / disable activation checkpoints)
Activation Partitioning and Activation Checkpointing Chunks
Notes