Implement user guidance

OpenAdaptAI / OpenAdapt

Open Source Generative Process Automation (i.e. Generative RPA). AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models

MIT License

960 stars 132 forks source link

When the replay strategy determines that the current Screenshot/WindowState is sufficiently different from the expected state in the recording, we would like to prompt the user to take over and demonstrate how to complete the task in that situation:

compare the current state to the expected state in the current recording.
if not found, compare the current state to states in other recordings with the same description.
if not found, pause replay
notify the user that they need to demonstrate how to continue
start a new recording (with the same description as the one being recorded) as soon as the user starts generating InputEvents
wait for the user to finish the demonstration (e.g. by clicking on the system tray icon)
restart replay from where we left off

Here are some questions and thoughts I had to address for some of these steps:

We can compare the screenshot captured during the replay to the screenshot captured during the recording(and subsequent screenshots) using computer vision tool such as OpenCV. The issue is how can we determine if its sufficiently different.
- Image comparison techniques such as thresholding and matching [1][2][3]. Thresholding involves setting a threshold value to determine if two images are similar or different, while matching involves comparing the features of two images using a distance calculation.
- Is this what you had in mind?
For the case when the user needs to demonstrate how to continue, we can use a desktop notification tool such as the WinToast library for Windows or the pynotify library for Linux(example below).
For waiting for the user to finish the demonstration (e.g. 10s of inactivity), we can use a Python timer such as the time module.

OpenAdaptAI / OpenAdapt

Implement user guidance #104