Questions on Your Dataset

yding25 / GPT-Planner

Paper: Integrating Action Knowledge and LLMs for Task Planning and Situation Handling in Open Worlds

MIT License

25 stars 2 forks source link

Thank you for appreciating our work.

The COWP does not utilize a vision system; instead, it operates within the realm of natural language. We assume a perfect Visual Question Answering (VQA) model capable of accurately describing scenes, with these descriptions serving as the situation context.

We actually attempted to develop a vision system. However, unfortunately, few robotics simulation platforms support visualizing various scenarios, such as coffee spills or broken cups, and it also demands significant human effort.

yding25 / GPT-Planner

Questions on Your Dataset #2