Closed andrewsu closed 1 year ago
I've set up the code to generate bar charts for single/multi select questions: example single select-
example multi select-
Let me know if I should make any changes to this format, or switch some from bar charts to other format like a pie chart. Also, I currently plan to add the percentages for each option in the legend.
Also, I was unsure how exactly to display the answers to the free text questions so please let me know on that.
Here is a PDF containing figures for all single/multi select questions in the sample data: All.pdf
This is looking really good!
Let me know if I should make any changes to this format, or switch some from bar charts to other format like a pie chart. Also, I currently plan to add the percentages for each option in the legend.
The bar charts I think are good. Just to make the visualization more concise, perhaps it could be adjusted to be something like this? I agree with adding exact numbers somewhere, either in the legend or as a label on the bar itself
Also, I was unsure how exactly to display the answers to the free text questions so please let me know on that.
Maybe something like this?
Here is a PDF containing figures for all single/multi select questions in the sample data: All.pdf
excellent start. Minor points I'm sure you've thought of...
I have an updated PDF now:
File: All.pdf
Also, @andrewsu when do you plan to update the sample data?
Looking good, thanks @rjawesome! A few notes:
Looks like the agree/disagree options changed -- compare Q10 to Q18
Seems to be because Somewhat Agree changes to Agree -- I guess I could treat these as equal?
Also some of the other issues seem to be due to different casing, so I will see how to make my method case insenstive
Looks like the agree/disagree options changed -- compare Q10 to Q18
Seems to be because Somewhat Agree changes to Agree -- I guess I could treat these as equal?
Yes, I think that would be fine...
Okay, I've generated new sample survey data in sample_survey_data_20230814.xlsx. A few notes:
Data
. L1 Reports
, L2 Reports
, L3 Reports
, and Institute
show what reports should be generated (and which should not) based on the sample data and our threshold to maintain anonymity. (These sheets will not be provided in the actual data file, and the analysis script should ignore these sheets.)Here, it says a report should be generated for "ACADEMIC RESEARCH" specified to each gender. However, one of the respondents for academic research in this sample data responded "prefer not to say". Therefore, my program interpreted this as "prefer not to say" being below the cutoff, and not generating gender specific reports.
Is this logic correct, or should "prefer not to say" be treated differently.
Great question. Yes, I was treating "prefer not to say" and blank values as special cases that can be ignored for the purposes of setting cutoffs for sub-groups. If a respondent says "prefer not to say", then their response just doesn't factor into any report based on gender identity.
Updated PDF: All.pdf This is based on the new sample data.
(I haven't changed it to use excel sheets yet, so currently I manually converted to TSV)
enough of the initial requirements are done that I'm going to close this issue as complete.
Notes
Department/Org Level 1
,Division/Org Level 2
, andStrategic Unit/Org Level 3
describe the levels of aggregation as we move up the org chartQ1:Gender Identity - Selected Choice
andQ5:Citizenship status
define the demographic groups. Reports for each one of these groups will be created when we meet the threshold to ensure anonymity only.Q6:How long have you been with Scripps Research?
represent the survey data to be summarized in a reportStrongly agree
,Agree
, ... ,Strongly Disagree
) should be shown as stacked bars. The vast majority of data should be in this format.Q13B:What methods do you use to communicate with your mentor? (Check all that apply) - Selected Choice
, example value isIn-person, one-on-one meetings,Group meetings,Email
.) These data should be shown as a bar chart showing the percentage of respondents who selected each response.Q54_1_TEXT:Is there something that you experienced working with previous mentors that you wish was also done with your current mentor? - Yes (please explain): - Text
). All answers should be presented in a simple text box.To-Do
This section will be broken out into individual tickets.
GRAD PROG - STUDENTS
sample_data.txt
to include more realistic counts for demographic groups to test inclusion/exclusion of reports as outlined in this analysis spreadsheet