learning-agent Search Results

1000+ results
for learning-agent

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

pocokhc/agent57 #8

Trying to run pendulum but getting errors

Hi, first of all, thanks for the great repository! I was trying to run the pendulum example but get the following error, however, it seems like the code continues till testing 5 episodes. I'm not s…

apoorvabanubakode updated 3 years ago
1
FutureSharks/ml-finance #18

Why mixing 'closing prices' of 3 different companies?

First of all, I would like to thank you for organizing and sharing such a nice repo, Max williams. I would appreciate it if you can answer one small question. Question - What is the point of u…

hellojinwoo updated 3 years ago
1
philtabor/Deep-Q-Learning-Paper-To-Code #14

Network is not learning when convolutional layers are applie…

Hey Phil! Thanks for the course. I'm really enjoying it so far. I've implemented the first real Deep Q Network, and it is not learning. Whenever I take off the convolutional layers and just use th…

DBaller updated 2 years ago
2
pfnet/pfrl #143

ACER - Examples on continuous action space

Hello, I am working on an RL project, where I want to use the ACER algorithm on continuous action space problems (Pybullet environments), but I have difficulties implementing it using Your framewor…

PKramek updated 3 years ago
16
Stable-Baselines-Team/stable-baselines3-contrib #224

Implementing "Sibling Rivalry" Method from "Keeping Your Dis…

### 🚀 Feature I propose the implementation of the "Sibling Rivalry" method, as outlined in the paper "Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards." Link to …

vladyskai updated 9 months ago
1
davaureli/MARL-in-SR-networks #1

Hello, do you have a related paper?

zhangmazi123321 updated 11 months ago
1
uchicago-computation-workshop/Winter2023 #3

2/09/2023: Mina Cikara

Comment below with a well-developed question or comment about the reading for this week's workshop. If you would really like to ask your question in person, please place two exclamation points befo…

GabeNicholson updated 1 year ago
71
webis-de/set-encoder #4

Tool learning for LLM

I am currently working on a problem to rerank tools (retrieving the appropriate tool for LLM), but the cross-encoder models are not converging. Here is an example: query: give me btc price tool: ge…

QuangTQV updated 2 months ago
5
ETEnterprises1/ET.ENT #146

Create:Let's start with the mission statement. Based on the…

Create:Let's start with the mission statement. Based on the name "Extraterrestrial Enterprises Crypto Banking Incorporated", I'll draft a possible mission statement: "At Extraterrestrial Enterprise…

ETEnterprises1 updated 6 days ago
1
PacktPublishing/Deep-Learning-with-PyTorch-1.x #4

Chapter 09

Value Iteration With Frozen Lake does not work. 1. It run into failure: env = gym.make('FrozenLake-v0'). It says to use v1 instead of v0. 2. Done. But when running last code, it says: /opt/cond…

AlexanderHuels updated 8 months ago
1

上一页 1...81 82 83 84 85 86 87...100 下一页

1000+ results for learning-agent

1000+ results
for learning-agent