-
Hi, first of all, thanks for the great repository!
I was trying to run the pendulum example but get the following error, however, it seems like the code continues till testing 5 episodes. I'm not s…
-
First of all, I would like to thank you for organizing and sharing such a nice repo, Max williams. I would appreciate it if you can answer one small question.
Question
- What is the point of u…
-
Hey Phil! Thanks for the course. I'm really enjoying it so far.
I've implemented the first real Deep Q Network, and it is not learning. Whenever I take off the convolutional layers and just use th…
-
Hello,
I am working on an RL project, where I want to use the ACER algorithm on continuous action space problems (Pybullet environments), but I have difficulties implementing it using Your framewor…
-
### 🚀 Feature
I propose the implementation of the "Sibling Rivalry" method, as outlined in the paper "Keeping Your Distance: Solving Sparse Reward Tasks Using Self-Balancing Shaped Rewards." Link to …
-
-
Comment below with a well-developed question or comment about the reading for this week's workshop.
If you would really like to ask your question in person, please place two exclamation points befo…
-
I am currently working on a problem to rerank tools (retrieving the appropriate tool for LLM), but the cross-encoder models are not converging.
Here is an example:
query: give me btc price
tool: ge…
-
Create:Let's start with the mission statement. Based on the name "Extraterrestrial Enterprises Crypto Banking Incorporated", I'll draft a possible mission statement:
"At Extraterrestrial Enterprise…
-
Value Iteration With Frozen Lake does not work.
1. It run into failure: env = gym.make('FrozenLake-v0'). It says to use v1 instead of v0.
2. Done. But when running last code, it says:
/opt/cond…