generalized-advantage-estimation Search Results

nikhilbarhate99/PPO-PyTorch #38

Discounted Reward Calulcation (Generalized Advantage Estimat…

I want to ask one more thing about the estimation of discounted reward. The variable discounted reward always starts with zero. However, if the episode is not ended, should it be the value estimation …

artest08 updated 1 year ago

bentrevett/pytorch-rl #1

Taking 'done' into consideration while calculating returns

Hello, thank you for making this repo, I think while calculating the returns you should take done into consideration as, ``` def calculate_returns(self, rewards, dones, normalize = True): …

murtazabasu updated 4 years ago

google-deepmind/rlax #28

stop_target_gradients default should be True in GAE function

`truncated_generalized_advantage_estimation` should have the `stop_target_gradients` defaulted to `True` https://github.com/deepmind/rlax/blob/383f93bc8b33c3d1bc28f15e1e07fc5104c790ea/rlax/_src/mul…

zhongwen updated 3 years ago

jinmang2/bring_it_on #3

링크 모음

https://lilianweng.github.io/lil-log/2018/04/08/policy-gradient-algorithms.html https://talkingaboutme.tistory.com/entry/RL-Policy-Gradient-Algorithms https://www.telesens.co/2019/04/21/understa…

jinmang2 updated 3 years ago

statsmodels/statsmodels #624

ENH: parameterized link functions for GLM, discrete,

example Box-Cox transformation with unknown parameter reference (that I just found again) A. Scallan, R. Gilchrist, M. Green "Fitting parametric link functions in generalised linear models" http://…

josef-pkt updated 2 months ago

statsmodels/statsmodels #5185

SUMM/ENH: extreme value distributions, pareto

Another ancient theme with nothing public in statsmodels. a brief github search with python repos for extreme value https://github.com/wafo-project/pywafo package by Per A. Brodtkorb but GPL http…

josef-pkt updated 6 years ago

JuliaStats/Roadmap.jl #14

Unify the efforts for Regression/GLM

Regression (_e.g._ linear regression, logistic regression, poisson regression, etc) is a very important in machine learning. Many problems can be formulated in the form of (regularized) regression. …

lindahua updated 9 years ago

statsmodels/statsmodels #2889

SUMM/ENH: dispersion modelling in GLM, exponential family

(this is triggered by some readings on dispersion estimation as followup to Tweedie #2858 #2872) the question is how do we estimate dispersion parameters or data (exog not mean) varying variance func…

josef-pkt updated 3 years ago

tensorflow/agents #681

PPOAgent Entropy Regularization, Clipping, GAE are working I…

I have been trying to implement a PPO Agent that solves LunarLander-v2 as in the official example in the github repo: https://github.com/tensorflow/agents/blob/master/tf_agents/agents/ppo/examples/v2…

kochlisGit updated 2 years ago

statsmodels/statsmodels #4264

ENH: Automatic model search - e.g. count models, parametric …

a bit similar to the idea of automatic forecasting: Find best fitting distributional assumption in MLE models. main advantages compared to users doing it themselves: - predefined sequence, autom…

josef-pkt updated 3 years ago

215 results for generalized-advantage-estimation

215 results
for generalized-advantage-estimation