Experiment Relation to Lottery Ticket

Tensor-Reloaded / Convergence

Artificial Neural Network training convergence boosting by smart data ordering

Academic Free License v3.0

0 stars 0 forks source link

Attempt the classic Lottery Ticket method of training a large model, pruning it and than retraining the subnet from a previous state forward.

Try to train the subnet from a random initialization but with "idea" order of training (brute force it)

The current ideas around tickets are that the initialization of a given subnet is what is the defining feature of the ticket, the best way of training the subnet currently being IMP with Rewinding technique.

Perhaps by having optimal order of training it would minimize the effect of the initialization of the subnet.

Lets see if the difference in accuracy between random initialization of subnet + perfect order vs IMP with Rewinding technique is reduced.

Tensor-Reloaded / Convergence

Experiment Relation to Lottery Ticket #6