-
**Feature Request: LangGraph Integration for Adaptive Agent Workflows in PufferLib**
**Objective**: Expand PufferLib’s capabilities by integrating LangChain, TRL (Transformers Reinforcement Learnin…
-
- [ ] [system-2-research/README.md at main · open-thought/system-2-research](https://github.com/open-thought/system-2-research/blob/main/README.md?plain=1)
# OpenThought - System 2 Research Links
He…
-
Develop a hybrid attack decision-making system for agents that combines a Q-learning neural network (QNN) and rule-based constraints. This system will allow agents to dynamically decide between aggres…
-
I haven't found the official code of AD, but there's some new works based on it such as DPT which the authors have released their code. I'm confused if i missed the AD's code. Could you please provide…
-
For example, when title in .tex file have following format:
- `\title{FlexiTex: Enhancing Texture Generation with Visual Guidance}`
- `\title[DeepMimic: Example-Guided Deep Reinforcement Learning o…
-
### Proposal
Add support for environments with `Graph` observation spaces in `AsyncVectorEnv` in Gymnasium. Currently, `AsyncVectorEnv` assumes observations can be stacked in a typical array-like for…
-
### Description
The project aims to develop a reinforcement learning (RL) agent to optimize waste collection in a simulated environment, minimizing overflow events and improving efficiency.
Environm…
-
### Description
It would be really handy to have the ability to log instantaneous events, especially in reinforcement learning projects.
For example, `wandb.log({"Events"; "Experience Replay starts"…
-
DearGioele Scaletta,
I hope this message finds you well.
My name is Mohadese Rezaei, and I recently came across your master's thesis titled "Deep Reinforcement Learning for Portfolio Optimizatio…
-
### Idea Contribution
- [X] I have read all the feature request issues.
- [X] I'm interested in working on this issue
- [X] I'm part of GSSOC organization
### Explain feature request
Adding proper …