deep-rl Search Results - Githubissues

1000+ results
for deep-rl

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

google-research/recsim #3

Tensorflow 2.0 and more complex agents examples

Hello, thank you for this great work! As I understand, RecSim is made for use with TF 1, because, if I'm not mistaken, we must provide a session to an agent. Do you plan to adapt interface for the…

pkorobov updated 3 years ago
2
Bartleby2718/DieOrDare #116

Is the agent actually learning?

학습이 되긴 하고 있는 건가 의심이 들어서 간단히 300번만 연습시켜 봄 RL: Deep Q-Learning with experience replay {epsilon: 0, discount rate: 0.95} NN: {optimizer: Adam, loss function: MSE, activation layer: ReLU} Player se…

Bartleby2718 updated 6 months ago
3
xiang578/xiang578.github.io #88

李宏毅强化学习课程笔记 | 算法花园

https://xiang578.com/post/reinforce-learnning-basic.html Info 课件下载：Hung-yi Lee - Deep Reinforcement Learning 课程视频：DRL Lecture 1: Policy Gradient (Review) - YouTube Change Log 20191226: 整理 PPO 相关资…

xiang578 updated 4 years ago
1
aws-samples/amazon-sagemaker-tsp-deep-rl #9

Missing problems file and inference file

Following `pytorch_inference.ipynb` steps, created a ipynb in `cd SageMaker/amazon-sagemaker-tsp-deep-rl` but got `ModuleNotFoundError: No module named 'problems'` Cloned `https://github.com/chaitjo…

ZacharyLaw updated 2 years ago
6
projectmesa/mesa #2142

huggingface hub integration

I have seen that https://gradio.app/ is used in the UIs for Hugging Face. @wang-boyu have you looked into it, since it is listed in one of the possible frameworks to use in the GSoC wiki? See also ht…

adamamer20 updated 4 weeks ago
6
ozanarkancan/dqn-agent #2

Learn what Mnih et. al. have done

**Read the paper and answer those questions and document the results on wiki** - What is the exact definition of the problem? - What are inputs and outputs (for each timestep vs. for each game) - How …

atilberk updated 9 years ago
2
DLR-RM/rl-baselines3-zoo #431

[Bug]: Custom Sub-Hyperparameters during train.py -> Optimiz…

### 🐛 Bug I am developing a custom Feature Extractor Type (based on DeepSets) for SB3 and want to train + optimize it with sb3_zoo. For it I add the following to a custom config.py file: ```pyth…

kingjin94 updated 8 months ago
1
BarisYazici/tum_masters_thesis #1

Step length and horizon?

Hello Baris! Great work on your master thesis! I am doing similar work for my master thesis and i am using some of your work as inspiration! We are using mujoco and Robosuite, and I am therefore not…

ludvikka updated 2 years ago
2
nebuly-ai/optimate #298

[Chatllama] KL Divergence equation

Hello, I have a quick question. I know most RLHF structure use KL divergence. https://github.com/nebuly-ai/nebullvm/blob/aad1c09ce20946294df3ec83569bad9496f58d0e/apps/accelerate/chatllama/chatllam…

mountinyy updated 1 year ago
3
PCCproject/PCC-Uspace #12

[online training] Couldn't see the PCC-RL training env recei…

Hi author, Thanks very much for your great work! -I'm following the online training instruction in Deep_Learning_Readme.md, but when I finished the third step(start the UDT side), I couldn’t see…

lesleychou updated 4 years ago
1

上一页 1...7 8 9 10 11 12 13...100 下一页

1000+ results for deep-rl

1000+ results
for deep-rl