dotnet / TorchSharp

A .NET library that provides access to the library that powers PyTorch.
MIT License
1.29k stars 167 forks source link

is there ppo example by torchsharp? #981

Closed diybl closed 8 months ago

diybl commented 1 year ago

is there ppo example by torchsharp?

GeorgeS2019 commented 1 year ago

@ChengYen-Tang WIP: PPO

If possible @diybl please support and coordinate with @ChengYen-Tang

GeorgeS2019 commented 1 year ago

@diybl @ChengYen-Tang Deel RL using TorchSharp for Deep Learning

FYI: @NiklasGustafsson

GeorgeS2019 commented 1 year ago

FYI: @NiklasGustafsson

WIP: TorchSharp for Deep QLearning

https://github.com/xin-pu/DeepSharp/discussions/10

Items

WIP: Deep RL using TorchSharp

mindmap
  root((Reinforcement<br/>Learning))
    Definitions
      Interactions
          Environment
          Agent
      Elements
         State
         Action
         Strategy<br/>策略
             Deterministic Policy<br/>确定性策略
             Stochastic Policy<br/>随机性策略
         State transfer probability<br/>状态转移概率
         Rewards<br/>即时奖励
      Others
        Episodes
        Trial
        Continuing Tasks
    Policy
      Policy based learning
      Value based learning
        Monte Carlo learning
             Temporal Difference Learning
                 SARSA<br/>State Action Reward State Action
                 QLearning
        Dynamic programming learning
            Policy iteration algorithm
                Policy Evaluation
                Policy Improvement
            Value iteration algorithm
    Markov Decision Process
      Markov Decision Process<br/>马尔科夫决策过程
           Trajectory<br/>轨迹
      Markov Process<br/>马尔科夫过程
   Objective Functions
GeorgeS2019 commented 10 months ago

@diybl @ChengYen-Tang @NiklasGustafsson <= dotnet team @xin-pu

This is done => Now with WinForm UI and Winform Chat to monitor training progress RL_Matrix => nuget Gym.NET using TorchSharp

@asieradzk RL Matrix

https://github.com/asieradzk/RL_Matrix/issues/1#issuecomment-1691274872

Next step is to bring it to Godot

asieradzk commented 10 months ago

Yup. I'm hot on brining RL Matrix to Godot, particularly to flex on Unity's ML Agents. Ideally I'd like to have at least: DDPG, Rainbow, Deep-MCTS (AlphaZero) on top of DQN and PPO

Please give me architecture advice on how you'd like environment/agent creation implemented, I am basing my current version on how matlab does it to allow easy swapping of agents/environments.

asieradzk commented 10 months ago

@asieradzk

Here, we only work on MIT FOSS license

Thank you, great to hear. I will see about license change once I am done with Godot RL agents. Hopefully I find some time in the next weeks in-between working on my PhD.

GeorgeS2019 commented 9 months ago

@NiklasGustafsson

cc this issue

GeorgeS2019 commented 8 months ago

Related issue

GeorgeS2019 commented 8 months ago

@NiklasGustafsson

There is now PPO example using TorchSharp

NiklasGustafsson commented 8 months ago

@diybl -- can this be closed? I'm eager to close out old issues to get a proper sense of the size of our backlog.