THUDM / WebRL

Building Open LLM Web Agents with Self-Evolving Online Curriculum RL
197 stars 9 forks source link