reinforcement-learning-environments Search Results

803 results
for reinforcement-learning-environments

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

dyweb/papers-notebook #181

Learning Scheduling Algorithms for Data Processing Clusters

https://web.mit.edu/decima/content/sigcomm-2019.pdf https://web.mit.edu/decima/ https://github.com/hongzimao/decima-sim 利用了强化学习 + GNN 做 DAG 任务的调度

gaocegege updated 5 years ago
4
tensorflow/agents #454

Can tf.agent policy return probability vector for all action…

I am trying to train a Reinforcement Learning agent using TF-Agent [TF-Agent DQN Tutorial](https://www.tensorflow.org/agents/tutorials/1_dqn_tutorial). In my application, I have 9 discrete actions (la…

bing-zhao updated 1 year ago
5
IntelLabs/coach #383

[Question] OpenAI Gym Tutorial

I'm trying to port an OpenAI Gym Environment and use coach for the learn on top. The tutorial currently reads (emphasis mine): ``` Adding an Environment Adding your custom environments to…

saltypeanuts updated 5 years ago
2
uchicago-computation-workshop/Fall2020 #7

11/5: Alison Gopnik

Comment below with questions or thoughts about the reading for this week's workshop. Please make your comments by Wednesday 11:59 PM, and upvote at least five of your peers' comments on Thursday pr…

ehuppert updated 3 years ago
109
rust-ml/linfa #7

Roadmap

In terms of functionality, the mid-term end goal is to achieve an offering of ML algorithms and pre-processing routines comparable to what is currently available in Python's [`scikit-learn`](https://s…

LukeMathWalker updated 3 months ago
81
Safe-RL-Team/viper-verifiable-rl-impl #1

Interested in contributions?

Hi! I am really interested in this project as the VIPER algorithm is relevant for my own research (which is also within safe and explainable RL). Therefore I would like to know, if you are interested …

andreashhpetersen updated 1 year ago
3
sentenai/reinforce #12

Add eligibility trace variants in algorithms

If you're unfamiliar with eligibility traces, they basically unify temporal-difference learning with monte carlo methods -- essentially you hold a buffer in memory of an agent's experience and perform…

stites updated 6 years ago
6
yyf17/NavigationProject #5

参考文献

| Name | pdf | github | | :--- | :--- | :---: | | |[Motion Planning and Cooperative Manipulation for Mobile Robots With Dual Arms](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnu…

yyf17 updated 2 years ago
3
jellAIfish/jellyfish #10

Artificial General Intelligence: Concept, State of the Art, …

https://intelligence.org/2013/08/11/what-is-agi/ https://pdfs.semanticscholar.org/72e1/4804f9d77ba002ab7f1d3e3a5e238a3a35ca.pdf Ben Goertzel - Chief Scientist of financial prediction firm Aidyia H…

markroxor updated 7 years ago
12
unizard/AwesomeArxiv #86

[2017.11.29] Vision In NIPS2017

**Proceedings** https://papers.nips.cc/book/advances-in-neural-information-processing-systems-30-2017 https://github.com/catpanda/NIPS_2017 **PaperLists (#Papers 679)** https://www.dropbox.com/s…

unizard updated 6 years ago
3

上一页 1...13 14 15 16 17 18 19...81 下一页

803 results for reinforcement-learning-environments

803 results
for reinforcement-learning-environments