-
Hi! I'm learning DDP method recently and also upvoted your brilliant implementation. It seems like you are using the MPC version of ilqr? I change it into normal version but it does not converged any …
-
Raw File: https://drive.google.com/drive/folders/1fCWSFAtrvlxlwXISEFKsYOpSXyQFrYCB?usp=sharing
Transcript: https://www.rev.com/transcript-editor/shared/zgZAdJPdVs8aCR8I5t9q4stRlSW5LOfj57qlO4VJ9Kzz7V5…
-
Learning AngularJS had been a struggle for me to learn at first--new vocabulary, $scope, digest cycle, controllers, the whole nine yards. Having been a developer for only half a year didn't help at al…
-
Hi, I'm interested in your work and appreciate the sharing of source code. I have some questions.
First, I run MT10-Conditioned task, I find that the time consumption is average 200s per epoch, meani…
-
**Issue for collecting ideas/research/plans related to how we want to shape our rewards.**
Current considerations:
- Does scale matter? (e.g. rewards 100 and 50 instead of 1 and 0.5)
- Tips from …
-
Hi Piotr,
Hope you are doing fine.
Based on the cognitive-agent example, I did some simple changes to the code using q-learning for the training. When running training the code gets stuck at env.…
-
Hi @dustinvtran
Can I add Tensorboard summaries during training when using the *logdir='log'* option? I tried several things, but nothing seemed to work. Here is my latest attempt:
```
sess = …
-
From Will Bans:
- Teachers can select a setting where all students automatically get a text to speech reading of the text. They then write it out.
- Students also see an "audio" button where they pl…
-
Line that triggers the error:
`q0, policy0, history0 = dyna_q(env, n=50, num_episodes=400, alpha=0.5, gamma=0.95)`
Python version: 3.10
Complete Error Message:
```
--------------------…
-
Niklas Lederer
Ferdinand Bubeck
Johannes Bubeck
Stefan Eckerle
Ausgesuchtes Spiel ist Flappy Bird