issues
search
mjuchli
/
ctc-executioner
Master Thesis: Limit order placement with Reinforcement Learning
176
stars
83
forks
source link
[Admin] Meeting on 15.01.18
#14
Closed
mjuchli
closed
6 years ago
mjuchli
commented
6 years ago
Progress
Framework
Orders (incl. Types)
Order book
Match Engine
Action State
Action
Actions Space
Reward functions tested with Q-Learning
Cumulative Reward
Profit on backtest
Next steps
Increase Action State, e.g. more features
Use policy gradient, e.g. recurrent neural nets
Imitation Learning
Questions
[x] Feedback on reward function
[x] Feedback on evaluation techniques
[x] Currently learns on random timestamp in order book. Would sequential learning and testing make sense?
mjuchli
commented
6 years ago
Remarks:
Add features from the future in order to artificially make profit with “cheating” (e.g. snooping).
Is random sampling worse than sequential sampling?
Do not interfere in reward function and leave as is
Progress
Next steps
Questions