Closed wailker3 closed 5 years ago
The author only used one CNN to calculate the evaluate Q value and the target Q valuate. Shouldn't there be another CNN to calculate the target Q valuate?
The author only used one CNN to calculate the evaluate Q value and the target Q valuate. Shouldn't there be another CNN to calculate the target Q valuate?