NM512 / dreamerv3-torch

Implementation of Dreamer v3 in pytorch.
MIT License
425 stars 96 forks source link

Meaning of keyword abbreviations #56

Open zichunxx opened 5 months ago

zichunxx commented 5 months ago

Hi!

I'm new to model-based reinforcement learning and thanks for your contribution to Pytorch users.

I tried to read your code to understand the logistics of dreamerv3 but found some details are not mentioned in the original paper, especially the abbreviations of some keywords.

For example,

https://github.com/NM512/dreamerv3-torch/blob/4e50f302cdfaca1c8104f203376844f82c635a4e/networks.py#L174-L179

the prev_state is a dict including three keys, i.e., logit, stoch, and deter.

What do they mean and where can I find a more specific explanation?

Thanks for your time!

NM512 commented 1 month ago

Thanks for asking!

Each of them means the following

If you have any questions, please ask more!