mnotgod96 / AppAgent

AppAgent: Multimodal Agents as Smartphone Users, an LLM-based multimodal agent framework designed to operate smartphone apps.
https://appagent-official.github.io/
MIT License
4.97k stars 538 forks source link

输入stop的后一直显示: Waiting for GPT-4V to generate documentation for the element #62

Open zhelloworld123456 opened 7 months ago

zhelloworld123456 commented 7 months ago

Which element do you want to tap? Choose a numeric tag from 1 to 13:

1 Choose one of the following actions you want to perform on the current screen: tap, text, long press, swipe, stop

stop Demonstration phase completed. 5 steps were recorded.

Warning! No module named 'soundfile' Warning! No module named 'tensorflow' Starting to generate documentations for the app AIXIXIHAHAHAPP based on the demo demo_AIXIXIHAHAHAPP_2024-03-21_14-06-44

Waiting for GPT-4V to generate documentation for the element android.widget.LinearLayout_462_97_com.android.calendar.id_extend_toolbar_title_parent_0