OpenAdaptAI / OpenAdapt

AI-First Process Automation with Large ([Language (LLMs) / Action (LAMs) / Multimodal (LMMs)] / Visual Language (VLMs)) Models
https://www.OpenAdapt.AI
MIT License
735 stars 95 forks source link

WIP:avoid unnecessary segmentation #783

Open Animesh404 opened 1 week ago

Animesh404 commented 1 week ago

What kind of change does this PR introduce?

attempt for issue #614 to extract the portion of active window that is different and combine with similar image

Summary

Trying to extract the portion of the active window that is different (i.e. greater than some tolerance) from the window retrieved, this can be used later to segment + describe this portion rather than the full thing.

Checklist

How can your code be run and tested?

try to run experiments/visualizing_segments.py

abrichr commented 1 week ago

That's a great start!

The next steps are: