Open BichengYing opened 4 years ago
It is known that win_ops will duplicate the parameters and consume more GPU memories. However, memory usage is not clear during the runtime. We need tools to have an accurate number.
Based on observation so far, we didn't observe much extra GPU memory usage in neighbor ops, win_ops versus allreduce
It is known that win_ops will duplicate the parameters and consume more GPU memories. However, memory usage is not clear during the runtime. We need tools to have an accurate number.