-
## Bug description
Hi, I'm currently adapting the Inverse Reinforcement Learning algorithm to analyze the behavior of mice in our lab studies. For this, I have used the Maximum Causal Entropy (MCE) a…
-
**Submitting author:** @ritamraha (Ritam Raha)
**Repository:** https://github.com/rajarshi008/Scarlet
**Branch with paper.md** (empty if default branch): main
**Version:** v1.0.0
**Editor:** @adi3
**R…
-
Hello,
https://github.com/JayZeeDesign/microsoft-autogen-experiments/blob/main/content_agent.py
I got a few instances of infinite loops running the above with gpt4.
- An agent, if I recall co…
-
## Motivation
This is somewhat related to https://github.com/pytorch/rl/issues/849. I realize TorchRL is quite new and I expect the documentation and examples will improve over time, but I believe th…
-
Paper title: Reflexion: Language Agents with Verbal Reinforcement Learning ([link to paper](https://arxiv.org/pdf/2303.11366.pdf))
Estimated time to complete the review: by 09/22/23
If you are new t…
-
Sik-Ho Tang. [Review: Representation Learning with Contrastive Predictive Coding (CPC/CPCv1)](https://sh-tsang.medium.com/review-representation-learning-with-contrastive-predictive-coding-cpc-cpcv1-8e…
-
### 🚀 Feature
Include AlphaZero in the library of available RL algorithm possibly with maskable actions option.
### Motivation
I am a beginner in Reinforcement Learning, but I get some interesting …
-
### Description & Motivation
[TensorDict](https://pytorch.org/rl/tensordict/) is a dictionary-like class that inherits properties from tensors, such as indexing, shape operations, casting to device…
-
Hi everyone, thanks a lot for the great library!
Could someone please explain a bit more on how the chat model has been trained? More specifically, I am interested in how the input/output data has …
-
### Describe the bug
Import Gymnasium will result in error:
```python
Traceback (most recent call last):
...
File "*", line 5, in
import gymnasium as gym
File "*/venv/lib/python3.10/…