tspooner / rl_markets

Market Making via Reinforcement Learning
https://arxiv.org/abs/1804.04216v1
BSD 3-Clause "New" or "Revised" License
311 stars 105 forks source link

theta_b of Class DoubleAgent unused? #9

Closed stephenkgu closed 4 years ago

stephenkgu commented 4 years ago

hi, first thanks for this great work.

getQ(s, a) and getQb(s, a) both use theta data, leave theta_b unused, is this correct?

I did not find any correspond DoubleAgent in project rsrl to reference too. @tspooner

tspooner commented 4 years ago

Hi Stephen,

Glad you got something from it! You are indeed right, there appears to be an error in that code. Interestingly, this must have been introduced when I re-factored the code for a public version as it doesn't appear in my original source. The DoubleAgent should use theta_b in the getQb function.

I'll push a change for this immediately, so thanks for the spot!

Indeed, I haven't implemented this in rsrl yet, but will get around to it ASAP.

Cheers, Tom

tspooner commented 4 years ago

See commit: 2de4cbb0936dd01cf5611e457f7bc2453403ef7d