carpedm20 / ENAS-pytorch

PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"
Apache License 2.0
2.7k stars 494 forks source link

REINFORCE #2

Open carpedm20 opened 6 years ago

carpedm20 commented 6 years ago

It is clear that controller falls into a local optimal while it can't find better actions from REINFORCE. I think unknown c of c/valid ppl, moving average baseline and temperature of logits are what needed to be fixed. See more details (especially TODOs) in 497c2e717dc0087fea52d4f196d30543e4fb7512.