Closed JohnAllen closed 1 year ago
Can someone help me understand where the training actually happens? Where do the rewards feed back back into a network or something that makes a better action in the future more likely?
Can someone help me understand where the training actually happens? Where do the rewards feed back back into a network or something that makes a better action in the future more likely?