BrunoScience / BrunoScience.github.io

The best that most of us can hope to achieve in science is simply to misunderstand at a deeper level.
https://BrunoScience.github.io
Apache License 2.0
0 stars 1 forks source link

Distributional Reinforcement Learning or Distributional Dynamic Programming #8

Open MarkBruns opened 1 year ago

MarkBruns commented 1 year ago

Distributional reinforcement learning goes beyond the common approach to reinforcement learning and expected values, in that it focuses on the total reward or value of a return obtained as a consequence of an agent's choices—specifically, how this return behaves in a probabilistic perspective.

Table of Contents Preface 1 Introduction 2 The Distribution of Returns 3 Learning the Return Distribution 4 Operators and Metrics 5 Distributional Dynamic Programming 6 Incremental Algorithms 7 Control 8 Statistical Functionals 9 Linear Function Approximation 10 Deep Reinforcement Learning 11 Two Applications and a Conclusion Notation Bibliography

Of course, valuation of risky alternatives must reflect uncertainty ... and uncertainty about uncertainty. So we are not only interested in variation, but in jumps or rough volatility of variation ... in other words, consideration of alternatives amounts to stochastic calculus with the best approximations of rough volatility or partially rough volatility for the valuations of various options presented.