Closed Fassty closed 1 year ago
After a discussion with a classmate who was copying the target weights after N epochs and not after N gradient updates I deemed this changes useful :)
After a discussion with a classmate who was copying the target weights after N epochs and not after N gradient updates I deemed this changes useful :)