mlcommons / algorithmic-efficiency

MLCommons Algorithmic Efficiency is a benchmark and competition measuring neural network training speedups due to algorithmic improvements in both training algorithms and models.
https://mlcommons.org/en/groups/research-algorithms/
Apache License 2.0
321 stars 62 forks source link

Imagenet ResNet variants are OOMing #644

Closed priyakasimbeg closed 6 months ago

priyakasimbeg commented 7 months ago

The following imagenet_resnet_silu and imagenet_resnet_silu are OOMing:

resnet_silu logs resnet_gelu logs

priyakasimbeg commented 6 months ago

Fixed in https://github.com/mlcommons/algorithmic-efficiency/pull/656