UBC-MDS / speed_dating_analysis

This repo is for a group project for analyzing speed dating data set for MDS-522
Other
0 stars 0 forks source link

Data Wrangling Info & Chat #7

Open wenyunie opened 10 months ago

wenyunie commented 10 months ago
Attribute Code Timing Type Notes
attr1_1 signup/Time1 Self Attribute Importance Rating
attr1_2 followup/Time2 Self Attribute Importance Rating Different Scales in different waves
attr1_3 followup2/ Time3 Self Attribute Importance Rating
attr2_1 signup/Time1 Perceived Opposite Sex Attribute Importance Rating
attr2_2 followup/Time2 Perceived Opposite Sex Attribute Importance Rating Total Point Assignment
attr2_3 followup2/Time3 Perceived Opposite Sex Attribute Importance Rating
attr3_1 signup/Time1 Self Rating
attr3_2 followup/Time2 Self Rating
attr4_1 signup/Time1 Perceived Fellow Attribute Importance Rating Different Scales in different waves
attr4_2 followup/Time2 Perceived Fellow Attribute Importance Rating Total Point Assignment
attr4_3 followup2/Time3 Perceived Fellow Attribute Importance Rating
attr5_1 signup/Time1 Perceived Other Rating On Self
attr5_2 followup/Time2 Perceived Other Rating On Self
attr5_3 followup2/Time3 Perceived Other Rating On Self
attr7_2 followup/Time2 Self Attribute Importance Rating with reflection Total Point Assignment
attr7_2 followup2/Time3 Self Attribute Importance Rating with reflection Total Point Assignment
attr1_s halfway event Self Attribute Importance Rating with reflection
attr3_s halfway event Self Rating with reflection
attr scoreboard Rating on partners
attr_o scoreboard Partner Rating on this person
wenyunie commented 10 months ago

This dataset is far more fun than I thought it would be.

monazhu commented 10 months ago

Yeah it's actually quite a rich data set! This could quite literally be someone's PhD dissertation 😂

monazhu commented 10 months ago

@wenyunie, I was looking at the analysis file more closely and I realized we might have used the wrong data column for the analysis (because we would actually expect there to be a correlation between self vs other perceived attractiveness, even if we tend to over-rate our own attractiveness). I think rather than summarizing over the attr column (which is how their partners rated them), we should be using attr_o (how they were rated by their partners.

I quickly looked at the stats and graphs and these results were more in-line with what I would expect. I'll send along the output file I did quickly via slack so you can see what I mean

rorywhite200 commented 10 months ago

Great spot Mona! Thanks. On a tangential note, there was an interesting paper that talks about the weak correlation between self vs other perceived attractiveness and said something like if everyone had the same bias then it would be a perfect correlation. Apparently unattractive people completely overestimate their attractiveness while attractive people underestimate, which is fascinating. https://pubmed.ncbi.nlm.nih.gov/32157701/#&gid=article-figures&pid=fig-1-uid-0

SJOP-61-471-g001

wenyunie commented 10 months ago

@rorywhite200 This could be a good explanation of the weak correlation! Is this plot from our dataset or the study in the link? Please feel free to change anything about the data analysis part if you would like to include this plot/analysis in our report.

wenyunie commented 10 months ago

I saw the plot above is from the linked study, and just out of curiosity, I did the same thing with our dataset. The general trend holds, only that attractive people are accurate about themselves.

image
wenyunie commented 10 months ago

It seems pretty close to their study2 result:

image
rorywhite200 commented 10 months ago

Wow that is strikingly similar! Cool work! I wonder if it is too late to include this? I guess it goes a bit beyond our hypothesis. Maybe in one of the future weeks we will get an opportunity to expand our analysis

monazhu commented 10 months ago

Ouu this is super neat stuff! I really love that we're all digging deep into the data and finding cool gems 😄 I do agree with Rory though that a quantile analysis is beyond the scope of milestone 1. It would definitely be neat to have in the final-final-final report though, if we have the opportunity to include it since you did all this work!

monazhu commented 10 months ago

Also this is a total random aside, but a correlation of ~0.3 in social sciences is honestly considered quite big (especially since it's others' rating vs your own rating). If I saw a correlation of like 0.6+ in a social sciences paper I actually might start to get worried 😂