Open kasmith opened 7 years ago
Do you think it makes sense to not give people feedback in terms of being able to watch the ball move when they try and experiment, but rather just immediate feedback on whether or not it went into the goal? That takes care of people simulating as they watch the ball move, ensures they don't get any additional information from the experiment regarding gravity/friction etc (although I guess we assume these learning curves have already asymptoted), as well as maintains the most similarity to the "only experiments" control condition where they just "function learn" which positions work.
I had a few other questions regarding the set-ups:
I think we could just increase the costs per second for the experiment; this way we also account for the fact that some experiments take longer than others. I agree that they can potentially always simulate and act at the same time, but I don't know how to avoid that. Thus, I'd rather make it a feature of the task from the start. Basically, what I'm saying is that I vote against binary feedback...;-)
I think it shouldn't influence the model too much if the distribution is binary but will have to check of course. I guess a good approach right now could be that we come up with interesting levels (maybe more than we would actually test) and then run the algorithm over those.
Meeting notes from 10/11/17 discussion
Experiment settings / changes:
Stimulus creation:
Model considerations:
Model API consists of three functions:
ground_truth_simulation(angle, trial): Outputs binary success value, travel distance (in px)
noisy_simulation(angle, trial, noise_parameters): Outputs binary success value, simulation travel distance (px)
multi_noisy_simulation(angle, trial, noise_parameters, n): Outputs probability of success over n trials, average simulation travel distance (should we split by success/failure?)
Next steps:
Whiteboard picture:
I've updated the experiment to account for the changes we discussed last week, including:
1) The score now decreases at 6/s instead of 10/s -- this seems much more reasonable 2) You cannot launch the ball (for either the experiment or action) unless your mouse is within 250px of the ball center -- this is to avoid hovering over a goal 3) Once the points run down, it is no longer an automatic loss; instead you get one chance to shoot the ball to avoid losing points. To make this clear to participants, there is now a gold outline around the table whenever the actual shot is active
This issue is for the discussion of the first experiment in which we demonstrate simulation & action change based on changes in costs of thinking (time) or experiments. We have a basic framework in the "experiment_development" branch with a number of moving parts that we need to decide on. These decisions come in two parts: structural questions and parameter settings.
Structural questions:
Parameter settings: