sa-tre / satre-team

A project management repo for the SATRE project
4 stars 1 forks source link

Public release of underlying survey data & analysis #20

Open harisood opened 1 year ago

harisood commented 1 year ago

Summary of issue

At some point soon we need to release a public version of the survey data, accompanied by reproducible code with the analysis we have run on the responses.

This needs to be done carefully to ensure any public release is as safe as possible

What needs to be done?

Who can help?

Issue checklist

manics commented 1 year ago

How does this sound for the initial release?

In future we can consider releasing anonymised individual responses so you can look for correlations, but this is higher risk and will take longer to assess or anonymise, so should be done in a future issue.

JimMadge commented 1 year ago

@manics's plan sounds good to me.

The only thing I would add is to do this for the free text questions where there are clear categories produce counts like with the likert questions. I would hope that the function in that notebook should do most of the heavy lifting.

harisood commented 1 year ago

Agreed, do we want to set a date for when to have this ready by? And maybe assigning responsibility?

manics commented 1 year ago

I can work on:

next week

I'm happy to look at the free text answers but that'll take longer, or someone else could look at them?

JimMadge commented 1 year ago

I wrote the free text -> categories function, so I'm happy to do that.

I would like to have someone else pass their eyes over that though. To make sure the code is doing what we think it should, and to make sure that the choices we make (e.g. which responses to drop, which words to count or not count, which synonyms to use) are reasonable.

harisood commented 1 year ago

@manics if you have time to do free text as well that'd be huge, otherwise I can try and find some time!

manics commented 1 year ago

I've created a private spreadsheet for splitting roles and institutions into categories which can be made public

harisood commented 1 year ago

Have you had a chance to look at free text?

manics commented 1 year ago

I haven't

manics commented 1 year ago

Foillowing the last WP1 meeting the plan is to create a new spreadsheet containing the raw (row-by-row) survey results, with the following changes:

The resulting spreadsheet will be checked for sensitive data, and if there are no problems it will be made public. The JISC online surveys JSON schema for the questionnaire will also be made available.

JimMadge commented 1 year ago

@manics Yes that looks possible, an example is in the last cell here.

harisood commented 1 year ago

LGTM!