Closed EricDinging closed 11 months ago
May I know the results for cat femnist_logging |grep "FL Testing"
while specify - gradient_policy: fed-yogi
?
@AmberLJC Here is the result [
](url)
- yogi_eta: 0.01
- yogi_tau: 0.001
- yogi_beta: 0.01
- yogi_beta2: 0.99
Thank you!
@fanlai0990
Why are these changes needed?
There is a bug when executor pulls the model from the aggregator. In original implementation, the model adapter will execute the optimizer step at the executor, which should theoretically be executed only at the aggregator end, leading to poor performance for fed-yogi. What I did:
gradient_policy
(optimizer) naming in several config files, fromyogi
tofed-yogi
. If useyogi
, the real optimizer would befed-avg
as if statement forfed-yogi
in optimizer is not entered.fed-yogi
according to the fed-yogi paper. Original setup might cause the model to drift a little bit before starting to convergeRelated issue number
243
Checks
FEMNIST fed-yogi optimizer run result