Closed zhengxiawu closed 5 years ago
umm. I am the author of ProxylessNAS. Actually, we have experimented on both RL and Gradients.
@Lyken17 Yes, I will correct it.
By naively considering latency as (CE_loss + lambda * latency), the Gradients-based models are slightly worse than RL-based models. So our main results are from RL-based approaches.
Thanks