colouring-cities / colouring-britain

Developed out of the Colouring London prototype. Collecting data on Britain's buildings and testing new core features
https://colouringbritain.org/
GNU General Public License v3.0
10 stars 2 forks source link

DATA ACCURACY/RELIABILITY: Features and discussion #96

Open polly64 opened 2 years ago

polly64 commented 2 years ago

1.Verification Button

2.Edit History

3.Source type dropdown

4.Source link

5.Uncertainty - Question phrasing

6.Specifying method of data capture

7.Cross checking/feedback loops

8.Data Accuracy text information

Notes: To improve data accuracy and quality/reduce malicious inaccurate input a) bulk uploads are moderated/recommended by academic partners, at regional level, with CCRP international research partner final decision on release b) manual entries can only be done building by building- This can be speeded up using the copy and paste tool. We have specifically chosen not to use the ArcGIS highlight large area and paste style option to prevent malicious behaviour and to make it as boring as possible for people to trash data. c) the following are included to increase reliability of data name of editor/edit history and date (we need to make last entry more visible) type of source source link d) the verification buttons tells you how many other users agree with date e) We are building a network of specialist users (see CLHEAG) to check and enrich data. Local planning groups. and local civic societies are set up specifically to oversee change in local areas. It is therefore in their interest to verify and monitor data as well as to enrich f) for age we are looking to cross check data generated using a number of methods these include: upload from unknown user upload from known expert group upload using historical street network inference upload using UK gov energy performance certificate data upload (if ever released by uk gov) of property tax age data g) we might have a feature where we allow all dates ever entered for a building to be viewed at once and link to editor name i) we may include image of facade but would need to do this in a way to keep storage light and also to not link to commercial products- e.g. googlestreetview where they could just change terms and conditions at any stage ( as they have for analysing the streetview data) k) we will probably include typology dropdown diagrams in the 'Type section' so this will also act as an additional verification l) we are interested in feedback loops between the automated processes and manual checking and how not to override the specialist input. We are trying to move towards a system which allows you to download say age data and asses the reliability yourself using all the above info. m) we preventing people deleting and then saving blank edit box. We need to change this so you can only delete if you enter an alternative date.

Statement on data accuracy 'The data are provided "as is", without warranty of any kind, express or implied, including but not limited to the warranties of merchantability, accuracy, fitness for a particular purpose and non-infringement. In no event shall ... add academic Colouring Cities host name... be liable for any reliance that you place on or how you use the data nor any claim, damages or other liability, whether in an action of contract, tort or otherwise, arising from, out of or in connection with the data or the use or other dealings in the data. As Colouring London data are crowdsourced from multiple sources and may contain errors, your help in adding sources and in verifying data entries is greatly appreciated. Visible edit histories, and phrasing of questions are also used to help you assess the accuracy and reliability of data and its suitability for your intended use, be this an academic paper, a school project or a government policy document. We will also be introducing other measures including icons to to indicate the way in which the data have been captured, for example through bulk upload from a monitored source, crowdsourcing at building level, computational generation, or live streaming. If you have suggestions for additional ways to improve data accuracy features please do comment at....'

polly64 commented 1 year ago

@mattnkm @h-petersen shall we start working here on verification tools for Australia? (as @mdsimpson42 will be moving Colouring London issues to core code repository)? You'll see code available already relating to the above which you can apply immediately an stuff we haven't done yet but want to. Matt what was that automated process you thought of re cross checking data? See also colouring-cities/colouring-core#882

polly64 commented 8 months ago

@polly64 to cjheck