learning-agent Search Results

1000+ results
for learning-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

tensortrade-org/tensortrade #414

Justify IntradayObserver

Justify IntradayObserver or remove from code (daily timesteps also work using the default observer)

abstractguy updated 2 years ago
1
number9473/nn-algorithm #261

IMPALA: Scalable Distributed Deep-RL with Importance Weighte…

# IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures # - Author: Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Dor…

joyhuang9473 updated 6 years ago
1
bb4/bb4-Q-learning #4

Add example that uses neural net instead of Table

TicTacToe has only a few thousand states, but for most applications the number of states will be more than will fit in memory. In those cases, some sort of approximation like nerual nets must be used.…

barrybecker4 updated 5 years ago
2
tensorflow/agents #454

Can tf.agent policy return probability vector for all action…

I am trying to train a Reinforcement Learning agent using TF-Agent [TF-Agent DQN Tutorial](https://www.tensorflow.org/agents/tutorials/1_dqn_tutorial). In my application, I have 9 discrete actions (la…

bing-zhao updated 1 year ago
5
eepson123tw/public-notes #29

ML Learning path & with AI

```md --- prompt 深度學習與監督室學習與非監督式學習與強化式學習與遷移式學習 1.差異差在哪? 2.有哪些模型案例及使用法你是個 ml 大師需提供我詳細的解釋及引導，專有名詞請加上英文，若為程式碼並用其他區塊區分用繁體中文若我的問題不清晰，你可以重組 --- ``` # ML Learning …

eepson123tw updated 2 days ago
1
2018-summer-DL-training-program/Lab-4-2048 #5

How will the "action" determined in the TD(0) agent

Hello TAs, For the Q learning case, it's intuitive to choose the best action that maximizes Q(s, a) for all possible a. But in the TD(0) agent's V function only has states as its input, V(s). How w…

favrei updated 6 years ago
1
w3c/webextensions #398

Support appending to User-Agent header in declarativeNetRequ…

In Chrome's implementation of declarativeNetRequest, we have an [explicit list](https://source.chromium.org/chromium/chromium/src/+/main:extensions/browser/api/declarative_net_request/constants.h;l=25…

oliverdunk updated 1 year ago
7
schemaorg/schemaorg #3140

add an ML Model type to schema.org, likely under /Dataset, .…

[Dataset](https://schema.org/Dataset) is pretty vague, it can cover anything from .zip files of .wavs of social science interviews, application-specific on-disk file formats, etc etc. In theory we cou…

danbri updated 3 weeks ago
18
nuno-faria/tetris-ai #8

Unable to reproduce the reported results

I retrained your model using the default hyperparameters in run.py, but my results are not similar to the reported results, the score is still too low after 2000 episodes. Could you please give me any…

InoriJam updated 1 month ago
3
ray-project/ray #23975

[RLlib] Replace Custom Multiagent API with PettingZoo's

### Description Right now, PettingZoo serves as something akin to a multi-agent version of Gym, with support for around a multi-agent dozen learning libraries and 25+ custom environments, which mak…

jkterry1 updated 7 months ago
5

上一页 1...54 55 56 57 58 59 60...100 下一页

1000+ results for learning-agent

1000+ results
for learning-agent