gao-g / prelude

Aligning LLM Agents by Learning Latent Preference from User Edits
https://arxiv.org/pdf/2404.15269
MIT License
17 stars 0 forks source link