Open kdmalc opened 1 year ago
What is optimal global aggregation and model push down strat? What is optimal number of iters per update? What is optimal eta for each training loop? This presumably affects the above
What is optimal global aggregation and model push down strat? What is optimal number of iters per update? What is optimal eta for each training loop? This presumably affects the above