Open abrichr opened 4 months ago
@seanmcguire12 your assistance would be greatly appreciated!
@KrishPatel13 outcome evaluation for web apps will depend on finishing https://github.com/OpenAdaptAI/OpenAdapt/pull/364
Save a fixture with recording.task_description = "test: calculate 2x3"
that is just like the video currently on the website.
Test 1: Run the VanillaReplayStrategy
with empty instructions
(or give it instructions like replay the recording verbatim
). Use openadapt.window
to assert that the calculator display area contains the expected value 6
.
Test 2: Run the VanillaReplayStrategy
with instructions like calculate 9-8+7
. Use the same API to assert that the calculator display area contains the expected value 8
.
Parameterize the replay strategy and iterate over all of them. Produce a report with the results.
@seanmcguire12 please submit a PR with your work-in-progress 🙏
Feature request
We need to extend https://github.com/OpenAdaptAI/OpenAdapt/issues/314 to include some useful tests and generate an automated report.
This involves:
VanillaReplayStrategy
) and evaluate the outcome. Outcome evaluation can be implemented withWindowEvent
data.Motivation
Scientific rigor and reproducibility.