Open tessavdheiden opened 2 years ago
Hi Matthew!
This repo is just great: It works, its transparant and modular!
I only found two differences between Ziebart's thesis and your implementation. Can you let me know if you were aware of them?
So here is Eq 9.2:
Here is your code:
And here is Eq 9.1:
Which uses $V^{\text{soft}}$:
And here is your code:
You include a discount factor in Eq 9.2, and in 9.1 you convert a subtraction ($Q^{\text{soft}}-V^{\text{soft}}$) into a fraction ($\frac{Q^{\text{soft}}}{V^{\text{soft}}}$), correct?
Hi Matthew!
This repo is just great: It works, its transparant and modular!
I only found two differences between Ziebart's thesis and your implementation. Can you let me know if you were aware of them?
So here is Eq 9.2:
Here is your code:
And here is Eq 9.1:
Which uses $V^{\text{soft}}$:
And here is your code:
You include a discount factor in Eq 9.2, and in 9.1 you convert a subtraction ($Q^{\text{soft}}-V^{\text{soft}}$) into a fraction ($\frac{Q^{\text{soft}}}{V^{\text{soft}}}$), correct?