How is the reward mode designed?

mnotgod96 / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.

https://appagent-official.github.io/

MIT License

4.97k stars 538 forks source link

How is the reward mode designed? #69

Open EthanLeo-LYX opened 6 months ago

EthanLeo-LYX commented 6 months ago

In the paper Section 4.2 Reward, you said that you developed a reward model to assess the performance by calculating the similarity between the final UI page and the object UI page. I wonder how the reward model is designed and trained. And would the reward model be released?

csdaa commented 6 months ago

这个项目视乎被抛弃了，换老外的吧

csdaa commented 6 months ago

这里有一篇他们的论文，也许能帮到你 https://arxiv.org/pdf/2312.13771.pdf