learning-agent Search Results

1000+ results
for learning-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

ExponentialDeepSpace/eds-archive #36

RL intro and more

## Prequest ![image](https://user-images.githubusercontent.com/1320252/123796714-fdc5b580-d917-11eb-9371-3e852a8a8051.png) - https://deepmind.com/learning-resources/-introduction-reinforcement-l…

NirViaje updated 2 years ago
7
Joshua-Ren/Neural_Iterated_Learning #3

Data is updated once every 4 iterations during PhaseB

See: https://github.com/Joshua-Ren/Neural_Iterated_Learning/blob/master/train.py#L222. It is common to update the data every iterations. Since I don't remember seeing it being discussed in the pape…

lavoiems updated 3 years ago
1
greenelab/deep-review #851

Precision Medicine as a control problem: Using simulation an…

https://arxiv.org/abs/1802.10440 > Sepsis is a life-threatening condition affecting one million people per year in the US in which dysregulation of the body's own immune system causes damage to its…

Gary-An updated 6 years ago
1
tensorflow/agents #699

Using Actor- Learner API and reverb for PPO agent

I am trying to adapt the SAC minitaur tutorial which uses the Actor-Learner API and reverb to work with the PPO agent. I changed the `tf_agent `from `sac_agent.SacAgent` to the `ppo_clip_agent.PPOCli…

sibyjackgrove updated 2 years ago
4
ikostrikov/pytorch-a2c-ppo-acktr-gail #126

Create config for all algorithms

Instead of using different default arguments for different algorithms, create config files and load arguments from there.

ikostrikov updated 6 years ago
6
jabrena/world-games-2017 #2

Brainstorming about technologies & topics

In order to be a democratic game, it is necessary that everyone participate in the brainstorming process. Everyone can write the different technologies and topics to discuss in the game. **Big Area…

jabrena updated 7 years ago
8
openai/spinningup #154

`mpi_statistics_scalar()` includes NaNs when computing mean …

The `spinningup/spinup/utils/mpi_tools/mpi_statistics_scalar` function computes the mean and variance in a manner that includes NaNs. As a consequence, the output of a simple agent learning will look …

RylanSchaeffer updated 5 years ago
1
montagejs/montage #1914

Loader.reel problem while learning Montage Studio. Forum see…

I have left a question at the [FORUM](http://forum.montagestudio.com/t/no-activity-in-over-a-year-beginning-learner-needs-help/125) but see no activity for more than a year. Where is the most recent …

CoolGames updated 6 years ago
2
mitodl/mitxpro #1910

Forgot password won't send email to user with partially-comp…

### Steps to Reproduce This one is a little sticky. I've been unable to reproduce it myself, but I've see it in production twice. (Lower priority for now, but I wanted to track it). While these…

briangrossman updated 3 years ago
2
cohense/RaceConversatioNmodel #4

Does It make sense for an individual to have a 0 probability…

cohense updated 7 years ago
2

上一页 1...84 85 86 87 88 89 90...100 下一页

1000+ results for learning-agent

1000+ results
for learning-agent