Closed morchickit closed 7 years ago
The data characteristics document can be found at: https://github.com/okfn/opendatasurvey/issues/905
NOTES TO INCLUDE IN FINAL REPORT (AND METHODOLOGY PAGE?)
The data characteristics are our unit of analysis. In theory all survey questions need to apply to the entire list of characteristics. Weather forecasts should be open for various types of forecasts.
Problem: Government publishes data in different files. One file is open, but does not have relevant granularity (does not meet all characteristics). Another file is closed, but contains relevant information. Reviewers find different datasets, some are openly licensed, others are not. Only some are in machine-readable formats. What should they do? A systematic approach is needed how we select our unit of analysis from different datasets.
Solution: In these cases the review can be based on two approaches
1) reviewers use a reference dataset that contains all relevant characteristics and answer questions B2 and B4 until B8 (B9 should be commented on) with reference to this dataset. The main point is that the dataset contains all data characteristics. If reviewers have to choose between two or more similar datasets, choose the one that is most applicable to the questions B2, B4-B8. They document their choice in the comment section. In case that something is unclear, reviewers consult the forum.
2) reviewers cannot find a reference dataset, because the data is split into various datasets. In this case, they refer questions B2 and B4-B8 to all datasets. GODI assumes that these datasets are representative for the data characteristics and in the end we want to assess how open this key information is. Example: If one dataset displays votes on bills and is openly licensed, but another one contains transcripts of debates in parliament and is not openly licensed, then the answer to question B7 would be "No".
Add a section on how to read our results. This is necessary so people understand our icons and can better interpret how they relate to one another. Prior discussions on the forum show, that people see logical links between our questions, where there are none, necessarily. For example some question how data can be in bulk if it is outdated. The assumption is that bulk should cover a longer time span of data - but this is not the case
Analyse if/how our new scoring influenced the ranking of the top 10 countries
Analysis of findability:
Country ranking (not done this year, but parked for next year) Ranking of countries
Findability Two approaches: checking how different users perceive findability; as well as URL analysis (see below)
How users perceive findability (some options how to assess this)
URL analysis Question and assumption
Approach
Interpretation & discussion
Submitter analysis (for @morchickit and @tlacoyodefrijol)
Comparison of findings between 2015 and 2016
See issue: https://waffle.io/okfn/opendatasurvey/cards/58de2fbc378dc331012bff20
[ ] User self-assessment: https://drive.google.com/drive/folders/0B5j55T4ZyssBQ09XQ1dWTGtibVE
@tlacoyodefrijol we need this for next week! Deadline is Tuesday
We need to write a final report to justify what we did to achieve the outlined goals.
Final report https://docs.google.com/document/d/1p38Y46tVxbP0TLFLAfv8qDdW2M1tqwiXR6SW-8kGJi4/edit#
Outtakes (contains relevant information): https://docs.google.com/document/d/1uNAln9QxLsOKoLv0SrU-OT2xfT1hXDQVD22V-YN15ss/edit
Folder with all relevant files here: https://drive.google.com/drive/folders/0B5j55T4ZyssBTlhsZEMzN2NJdU0
Raw data for "GODI RESULTS" section are here: https://docs.google.com/spreadsheets/d/17zmn4wgxPaJozY9EjETjoL4ZVsV_T4wu5EYBT4B5MHY/edit#gid=560413061