ProjectSidewalk / sidewalk-data-analysis

Holds all offline data analysis scripts for Project Sidewalk required for our forthcoming paper submission
3 stars 0 forks source link

Table: Updating table showing accuracy and behavioral statistics for user groups (section 4.6) #29

Closed manaswisaha closed 6 years ago

manaswisaha commented 6 years ago

Current table shows median accuracy measures and behavioral metrics: image

by user groups: image

We need to update the table to add:

  1. Visual Search time for different groups
  2. SDs for accuracy numbers
manaswisaha commented 6 years ago

Updated table with the added values would look something like this:

image

jonfroehlich commented 6 years ago
manaswisaha commented 6 years ago

Okay, I will change the font. It is currently on the right of the main number. However, it doesn’t fit in the space. On Fri, May 25, 2018 at 5:33 PM Jon Froehlich notifications@github.com wrote:

Please make it so that SD is to the right of the main number in each cell rather than below it. If it's below, you cannot vertically scan the numbers. Use a horizontally condensed font like Segoe Condensed or Arial Narrow as necessary.

— You are receiving this because you were assigned.

Reply to this email directly, view it on GitHub https://github.com/ProjectSidewalk/sidewalk-data-analysis/issues/29#issuecomment-392220772, or mute the thread https://github.com/notifications/unsubscribe-auth/ACvXgL7NqB1V_5bgW9JSb7GiFIOTfjUYks5t2KLsgaJpZM4UF6GY .

manaswisaha commented 6 years ago

Also, should recall go first?

The usual convention is to put precision first. But we can go with recall first since that is the most important metric for us.

And while I like your cell highlighting, perhaps you need to highlight all cells (e.g., using a gradient based on cell number value) rather than just specific cells.

I was using two color highlights: green (>80% accuracy) and red (<40% accuracy) to draw attention. Should I make a mono-color gradient?

Why are we using median rather than average?

We use median throughout the paper for the first number and then mention mean and SD in the parenthesis. That's what we selected because it was a better metric than mean when the data is skewed towards either sides.

Finally, is there a reason why we are splitting up All and Problem? Do you think this is important? Are we consistently doing this throughout the paper?

"All" is across all label types and "problem" is for identifying a problem type. We have been using both throughout the paper. We have been using "all" to report overall accuracy across all label types, and have been using "problem" similar to CHI13.

Does the value of doing this split outweigh the additional challenge of presenting the data?

I think reporting both is important in the label type vs accuracy section. However, in this section where we are comparing between user groups, we can just talk about their accuracy across all label types and drop "problem" type. Thoughts?

misaugstad commented 6 years ago

Finally, is there a reason why we are splitting up All and Problem? Do you think this is important? Are we consistently doing this throughout the paper?

I do think it is important to make the distinction. Curb ramp accuracy is very high across the board, and the large number of curb ramps end up causing curb ramp data to sort of crowd out the other data. This is a problem because, although identifying accessibility features is important, the really important thing we are trying to do is identify accessibility problems. So I do think it is worth showing both in most cases.