Data Wrangling Info & Chat

wenyunie commented 10 months ago

Attribute Code	Timing	Type	Notes
attr1_1	signup/Time1	Self Attribute Importance Rating
attr1_2	followup/Time2	Self Attribute Importance Rating	Different Scales in different waves
attr1_3	followup2/ Time3	Self Attribute Importance Rating
attr2_1	signup/Time1	Perceived Opposite Sex Attribute Importance Rating
attr2_2	followup/Time2	Perceived Opposite Sex Attribute Importance Rating	Total Point Assignment
attr2_3	followup2/Time3	Perceived Opposite Sex Attribute Importance Rating
attr3_1	signup/Time1	Self Rating
attr3_2	followup/Time2	Self Rating
attr4_1	signup/Time1	Perceived Fellow Attribute Importance Rating	Different Scales in different waves
attr4_2	followup/Time2	Perceived Fellow Attribute Importance Rating	Total Point Assignment
attr4_3	followup2/Time3	Perceived Fellow Attribute Importance Rating
attr5_1	signup/Time1	Perceived Other Rating On Self
attr5_2	followup/Time2	Perceived Other Rating On Self
attr5_3	followup2/Time3	Perceived Other Rating On Self
attr7_2	followup/Time2	Self Attribute Importance Rating with reflection	Total Point Assignment
attr7_2	followup2/Time3	Self Attribute Importance Rating with reflection	Total Point Assignment
attr1_s	halfway event	Self Attribute Importance Rating with reflection
attr3_s	halfway event	Self Rating with reflection
attr	scoreboard	Rating on partners
attr_o	scoreboard	Partner Rating on this person

wenyunie commented 10 months ago

This dataset is far more fun than I thought it would be.

monazhu commented 10 months ago

Yeah it's actually quite a rich data set! This could quite literally be someone's PhD dissertation 😂

monazhu commented 10 months ago

@wenyunie, I was looking at the analysis file more closely and I realized we might have used the wrong data column for the analysis (because we would actually expect there to be a correlation between self vs other perceived attractiveness, even if we tend to over-rate our own attractiveness). I think rather than summarizing over the attr column (which is how their partners rated them), we should be using attr_o (how they were rated by their partners.

I quickly looked at the stats and graphs and these results were more in-line with what I would expect. I'll send along the output file I did quickly via slack so you can see what I mean

rorywhite200 commented 10 months ago

Great spot Mona! Thanks. On a tangential note, there was an interesting paper that talks about the weak correlation between self vs other perceived attractiveness and said something like if everyone had the same bias then it would be a perfect correlation. Apparently unattractive people completely overestimate their attractiveness while attractive people underestimate, which is fascinating. https://pubmed.ncbi.nlm.nih.gov/32157701/#&gid=article-figures&pid=fig-1-uid-0

SJOP-61-471-g001

wenyunie commented 10 months ago

@rorywhite200 This could be a good explanation of the weak correlation! Is this plot from our dataset or the study in the link? Please feel free to change anything about the data analysis part if you would like to include this plot/analysis in our report.

wenyunie commented 10 months ago

I saw the plot above is from the linked study, and just out of curiosity, I did the same thing with our dataset. The general trend holds, only that attractive people are accurate about themselves.

wenyunie commented 10 months ago

It seems pretty close to their study2 result:

rorywhite200 commented 10 months ago

Wow that is strikingly similar! Cool work! I wonder if it is too late to include this? I guess it goes a bit beyond our hypothesis. Maybe in one of the future weeks we will get an opportunity to expand our analysis

monazhu commented 10 months ago

Ouu this is super neat stuff! I really love that we're all digging deep into the data and finding cool gems 😄 I do agree with Rory though that a quantile analysis is beyond the scope of milestone 1. It would definitely be neat to have in the final-final-final report though, if we have the opportunity to include it since you did all this work!

monazhu commented 10 months ago

Also this is a total random aside, but a correlation of ~0.3 in social sciences is honestly considered quite big (especially since it's others' rating vs your own rating). If I saw a correlation of like 0.6+ in a social sciences paper I actually might start to get worried 😂

UBC-MDS / speed_dating_analysis

Data Wrangling Info & Chat #7