mayacakmak / se2

Control interfaces for manipulating SE2 configurations
BSD 2-Clause "Simplified" License
1 stars 0 forks source link

Study 2 data processing, analysis, figures #20

Open mayacakmak opened 3 years ago

mayacakmak commented 3 years ago

@KaviMD Creating a separate thread for study 2 related data stuff. The first MTurk batch is not running so soon we should have some data to look at.

I know you've updated the data model a bit in on Firebase. Is the data processing code under se3/ also updated? What needs to be done there?

Data we'll want to extract from the logs:

Other ideas?

kavidey commented 3 years ago

Currently, there is no code for data processing under se3/. Most of the code for data processing under se2/ should work for se3 though.

The big things that need to be changed are where data is being downloaded from on firebase, and where the action list for each cycle is. Previously this was users/{uid}/sessions/{sid}/cycles/{cid}/events, for se3, the cycle metadata (target pose, total time, number of IK arm resets, etc.) is stored under se3/users/{uid}/sessions/{sid}/cycles/{cid}/ and the action lists for each cycle (clicks, view changes, etc.) is stored under se3/cycles/{cid}/.

I am working on updates to the SE2 data processing code right now, but I can start on SE3 when that is finished.

mayacakmak commented 3 years ago

Sounds good! we should have the first batch data in the databse for testing SE3 data processing.

kavidey commented 3 years ago

As of right now, all of the data processing scripts in se3/ except for process_data.py should be working with the SE3 data (there isn't enough data to get in-depth results right now. The user filtering was also disabled)

One thing that I'm not sure about is what the best way to calculate the distance between the pose of the target and the starting pose of the EE (position could be euclidean distance, but I don't know what metric to use for rotation). I'm also not sure of what the best way to calculate how "flexible" a target is. This could just be the value of the thinnest axis, but there might be a more advanced way to do it (does this change for box vs cylinder targets?).

Some notes:

Right now, the .csv file generated by json_to_csv.py should have:

Something else that might be interesting to add (but is not there right now) is the amount of time spent on each view, the number of times the view was switched, or the total number of clicks for each view

mayacakmak commented 3 years ago

Nice! @KaviMD I don't think we need to worry about numerically representing distance and size/flex of targets for Study2 (i.e. we won't try to recreate the scatter plots). Since it's only 5 tasks we can visually show what they are--what matters more directly is how much the gripper has to move/rotate to grasp it.. we asked participants to grasp a certain way and I think you set up the tasks nicely so the first two only requires translation, the latter three require rotation in increasing number of dimensions. The issue is we're not really enforcing this and I noticed it is sometimes possible to grasp the horizontal objects from the side. So basically we won't make any strong statements about how "far" "difficult" the tasks are, we just say what they are. But then perhaps it might be useful to report how much people changed position and how much they changed rotation for each task -- I'm pretty sure needing to change rotation adds to completion time/leads to more resets/etc.

And yes, it would be great to add:

kavidey commented 3 years ago

json_to_csv.py is updated with the new view metrics. process_data.py is integrated into the new se3 data, but most of the graphs that it was originally meant to create rely on distance metrics that we don't currently have calculated for se3

I uploaded .csv files generated from all the se3 data we have right now to the Google Drive folder: Cycle Data: https://drive.google.com/file/d/1Q3ssA1NFlrm_nSFDvnzNUTRp75VysM3R/view?usp=sharing Questionnaire Data: https://drive.google.com/file/d/1Ht0VgF-SFtJHcGT5f3pcmsIcxoRL-KBC/view?usp=sharing

mayacakmak commented 3 years ago

I'll go ahead and release more HITs now. Inspecting the questionnaire data, the quality seems much higher b/c I restricted 'location' to US only.