andrewsu / mentorship-survey-analysis

MIT License
0 stars 0 forks source link

initial analysis notes #1

Closed andrewsu closed 1 year ago

andrewsu commented 1 year ago

Notes

To-Do

This section will be broken out into individual tickets.

rjawesome commented 1 year ago

I've set up the code to generate bar charts for single/multi select questions: example single select- image

example multi select- image

Let me know if I should make any changes to this format, or switch some from bar charts to other format like a pie chart. Also, I currently plan to add the percentages for each option in the legend.

Also, I was unsure how exactly to display the answers to the free text questions so please let me know on that.

Here is a PDF containing figures for all single/multi select questions in the sample data: All.pdf

andrewsu commented 1 year ago

This is looking really good!

Let me know if I should make any changes to this format, or switch some from bar charts to other format like a pie chart. Also, I currently plan to add the percentages for each option in the legend.

The bar charts I think are good. Just to make the visualization more concise, perhaps it could be adjusted to be something like this? I agree with adding exact numbers somewhere, either in the legend or as a label on the bar itself

image

Also, I was unsure how exactly to display the answers to the free text questions so please let me know on that.

Maybe something like this?

image

Here is a PDF containing figures for all single/multi select questions in the sample data: All.pdf

excellent start. Minor points I'm sure you've thought of...

rjawesome commented 1 year ago

I have an updated PDF now:

File: All.pdf

Also, @andrewsu when do you plan to update the sample data?

andrewsu commented 1 year ago

Looking good, thanks @rjawesome! A few notes:

rjawesome commented 1 year ago

Looks like the agree/disagree options changed -- compare Q10 to Q18

Seems to be because Somewhat Agree changes to Agree -- I guess I could treat these as equal?

Also some of the other issues seem to be due to different casing, so I will see how to make my method case insenstive

andrewsu commented 1 year ago

Looks like the agree/disagree options changed -- compare Q10 to Q18

Seems to be because Somewhat Agree changes to Agree -- I guess I could treat these as equal?

Yes, I think that would be fine...

andrewsu commented 1 year ago

Okay, I've generated new sample survey data in sample_survey_data_20230814.xlsx. A few notes:

rjawesome commented 1 year ago

image Here, it says a report should be generated for "ACADEMIC RESEARCH" specified to each gender. However, one of the respondents for academic research in this sample data responded "prefer not to say". Therefore, my program interpreted this as "prefer not to say" being below the cutoff, and not generating gender specific reports.

Is this logic correct, or should "prefer not to say" be treated differently.

andrewsu commented 1 year ago

Great question. Yes, I was treating "prefer not to say" and blank values as special cases that can be ignored for the purposes of setting cutoffs for sub-groups. If a respondent says "prefer not to say", then their response just doesn't factor into any report based on gender identity.

rjawesome commented 1 year ago

Updated PDF: All.pdf This is based on the new sample data.

(I haven't changed it to use excel sheets yet, so currently I manually converted to TSV)

andrewsu commented 1 year ago

enough of the initial requirements are done that I'm going to close this issue as complete.