q-learning Search Results

1000+ results
for q-learning

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

neka-nat/ddp-gym #2

unconverge

Hi! I'm learning DDP method recently and also upvoted your brilliant implementation. It seems like you are using the MPC version of ilqr? I change it into normal version but it does not converged any …

TianrongChen updated 5 years ago
1
promazo/Content-Team #90

USAA-Russ Major (Medium Post)

Raw File: https://drive.google.com/drive/folders/1fCWSFAtrvlxlwXISEFKsYOpSXyQFrYCB?usp=sharing Transcript: https://www.rev.com/transcript-editor/shared/zgZAdJPdVs8aCR8I5t9q4stRlSW5LOfj57qlO4VJ9Kzz7V5…

jaahmuhl updated 1 year ago
4
AngularJSUtah/meetups-slc-group #4

(New developer-friendly) ngUpgrade - Angular 2 for the Angul…

Learning AngularJS had been a struggle for me to learn at first--new vocabulary, $scope, digest cycle, controllers, the whole nine yards. Having been a developer for only half a year didn't help at al…

cwadrupldijjit updated 7 years ago
7
RchalYang/Soft-Module #5

Training efficiency

Hi, I'm interested in your work and appreciate the sharing of source code. I have some questions. First, I run MT10-Conditioned task, I find that the time consumption is average 200s per epoch, meani…

kevin-xuan updated 7 months ago
3
ivo-1/bomberman_rl #3

Reward Shaping

**Issue for collecting ideas/research/plans related to how we want to shape our rewards.** Current considerations: - Does scale matter? (e.g. rewards 100 and 50 instead of 1 and 0.5) - Tips from …

aileen-reichelt updated 2 years ago
2
tkn-tub/ns3-gym #53

ZMQError: Operation cannot be accomplished in current state

Hi Piotr, Hope you are doing fine. Based on the cognitive-agent example, I did some simple changes to the code using q-learning for the training. When running training the code gets stuck at env.…

sheila-janota updated 1 year ago
1
blei-lab/edward #478

Adding Tensorboard summaries during training

Hi @dustinvtran Can I add Tensorboard summaries during training when using the *logdir='log'* option? I tried several things, but nothing seemed to work. Here is my latest attempt: ``` sess = …

rfarouni updated 7 years ago
1
Empirical-org-Archive/Quill-Grammar-Ideas #41

Provide Text to Speech

From Will Bans: - Teachers can select a setting where all students automatically get a text to speech reading of the text. They then write it out. - Students also see an "audio" button where they pl…

petergault updated 9 years ago
2
vojtamolda/reinforcement-learning-an-introduction #17

Error in "Exercise 8.4*.ipynb" --> "TypeError: dispatcher fo…

Line that triggers the error: `q0, policy0, history0 = dyna_q(env, n=50, num_episodes=400, alpha=0.5, gamma=0.95)` Python version: 3.10 Complete Error Message: ``` --------------------…

MariosGkMeng updated 7 months ago
2
michael-spengler/aktuelle-data-science-entwicklungen-2-wwi19dsab #15

Reinforcement Learning for Games

Niklas Lederer Ferdinand Bubeck Johannes Bubeck Stefan Eckerle Ausgesuchtes Spiel ist Flappy Bird

stefaneckerle updated 2 years ago
3

上一页 1...94 95 96 97 98 99 100...100 下一页

1000+ results for q-learning

1000+ results
for q-learning