-
https://web.mit.edu/decima/content/sigcomm-2019.pdf
https://web.mit.edu/decima/
https://github.com/hongzimao/decima-sim
利用了强化学习 + GNN 做 DAG 任务的调度
-
I am trying to train a Reinforcement Learning agent using TF-Agent [TF-Agent DQN Tutorial](https://www.tensorflow.org/agents/tutorials/1_dqn_tutorial). In my application, I have 9 discrete actions (la…
-
I'm trying to port an OpenAI Gym Environment and use coach for the learn on top.
The tutorial currently reads (emphasis mine):
```
Adding an Environment
Adding your custom environments to…
-
Comment below with questions or thoughts about the reading for this week's workshop.
Please make your comments by Wednesday 11:59 PM, and upvote at least five of your peers' comments on Thursday pr…
-
In terms of functionality, the mid-term end goal is to achieve an offering of ML algorithms and pre-processing routines comparable to what is currently available in Python's [`scikit-learn`](https://s…
-
Hi! I am really interested in this project as the VIPER algorithm is relevant for my own research (which is also within safe and explainable RL). Therefore I would like to know, if you are interested …
-
If you're unfamiliar with eligibility traces, they basically unify temporal-difference learning with monte carlo methods -- essentially you hold a buffer in memory of an agent's experience and perform…
-
| Name | pdf | github |
| :--- | :--- | :---: |
| |[Motion Planning and Cooperative Manipulation for Mobile Robots With Dual Arms](https://ieeexplore.ieee.org/stamp/stamp.jsp?tp=&arnu…
yyf17 updated
2 years ago
-
https://intelligence.org/2013/08/11/what-is-agi/
https://pdfs.semanticscholar.org/72e1/4804f9d77ba002ab7f1d3e3a5e238a3a35ca.pdf
Ben Goertzel - Chief Scientist of financial prediction firm Aidyia H…
-
**Proceedings**
https://papers.nips.cc/book/advances-in-neural-information-processing-systems-30-2017
https://github.com/catpanda/NIPS_2017
**PaperLists (#Papers 679)**
https://www.dropbox.com/s…