sa-tre / satre-team

A project management repo for the SATRE project
4 stars 1 forks source link

Questionnaire summary: Results from free text (answers from a definable set) #18

Open harisood opened 1 year ago

harisood commented 1 year ago

Summary of issue

Once the process for handling questions is decided (#12), we need to write reproducible code that allows us to get all the outputs from these questions that can be used to open issues for the specification repo

- 7.a Which non-desktop interfaces are important to you?
- 8.a Which programming languages are important to you?
- 9.a Which repositories are important to you?
- 10.a Which commercially licensed software is important to you?
- 23.a Are there sensitivity systems that you think are important or use?

What needs to be done?

Who can help?

Issue checklist

harisood commented 1 year ago

@JimMadge @craddm updated list of all questions your guys code will cover!

We also need to think how to incorporate results from related free text questions in #17 - e.g. the MCQ for how important programming language support is, and the free text laying out the languages a TRE should support

JimMadge commented 1 year ago

Progress today: https://github.com/sa-tre/feature_survey/commit/fb3259163ee97c7e99d76b19cf34f01bd57d4afa

generates table like

language votes
python 50
r 50
stata 9
sql 8
c# 4

known issues,

  1. Double counting within one answer (e.g. someone says "Python, Python, Python, I love Python"
  2. This is not natural language processing (e.g. someone says "Absolutely not Python, it is bad")
  3. Not currently % of respsones. Should that be, % of total responses, % of responses to this question, % of responses we didn't drop?
JimMadge commented 1 year ago

At https://github.com/sa-tre/feature_survey/commit/70f698c5603406ef1bf3a5e22b25d62a1bc39d14

Percentage of accepted responses,

language count percentage
python 50 79
r 50 79
stata 9 14
sql 8 13
c# 4 6
JimMadge commented 1 year ago

As of https://github.com/sa-tre/feature_survey/commit/fe2d4a666ed0d61fb63022a535f7f0a5a9657d24

Multiple counts from a single answer no longer affect the output.