algorithmsbooks / decisionmaking

Algorithms for Decision Making textbook
516 stars 53 forks source link

Typo in Chapter 12, pg. 257: "similary" #85

Closed lkruse closed 2 years ago

lkruse commented 2 years ago

• We can use a pessimistic lower bound of the trust region policy optimization objective to obtain a clamped surrogate objective that performs similary without the need for line search.

mykelk commented 2 years ago

Thanks!