DigiRL-agent / digirl

Official repo for paper DigiRL: Training In-The-Wild Device-Control Agents with Autonomous Reinforcement Learning.
Apache License 2.0
246 stars 20 forks source link

Suggestions: Terminal support and OpenDevin intergration, further training #12

Closed James4Ever0 closed 3 months ago

James4Ever0 commented 3 months ago

OpenDevin has a problem about interacting with terminal. It would be nice to let this RL agent to operate in terminal and use Vim, even play VimGolf.

I haven't seen a single agent/model capable of doing that, but this one has the potential.


Further training involves automatic task generation and evaluation.

I can name a few for you:

Also you can check my project for more information.

BiEchi commented 3 months ago

Thanks for your interest in our work. We are currently developing better RL algorithms and integrating larger models for device control. Most of the ideas you mentioned are in scope of our near research - please keep an eye on our research!

BiEchi commented 3 months ago

Closing due to inactivity.

James4Ever0 commented 2 months ago

@BiEchi Developed a terminal interaction environment for agents, capable of converting all info from terminal into meaningful text, including cursor and styling information.

tmux_show_1

Terminal environment can be captured as image with cursor denoted in red:

vim_edit_tmux_screenshot