DeNA / HandyRL

HandyRL is a handy and simple framework based on Python and PyTorch for distributed reinforcement learning that is applicable to your own environments.
MIT License
282 stars 42 forks source link

feature: accept any structure of results #327

Closed YuriCat closed 2 years ago

YuriCat commented 2 years ago

This commit enables us to accept outputs like {'estimate_input': {'a': Tensor, 'b': Tensor}}

YuriCat commented 2 years ago

This PR is not enough.