The goal of this pull request is to (1) preserve the present CI as is, and (2) allow users to export an MHLO for on-device inference and fine-tuning, which requires batch_size == 1.
I confirmed that I can execute the following commands to generate the MHLO file and build it into Metal GPU kernels.
The goal of this pull request is to (1) preserve the present CI as is, and (2) allow users to export an MHLO for on-device inference and fine-tuning, which requires
batch_size == 1
.I confirmed that I can execute the following commands to generate the MHLO file and build it into Metal GPU kernels.
The command
iree-compile
was built from the feature branch https://github.com/antiagainst/iree/tree/apple-metal-hal.