earthlab-education / Earth-Analytics-AY24

Class repository for the Earth Analytics Professional Certificate program AY 2024
https://earthlab-education.github.io/Earth-Analytics-AY24/
MIT License
0 stars 0 forks source link

brglea [READING RESPONSE] Crowd-sourcing and Open Street Maps #122

Closed github-actions[bot] closed 2 months ago

github-actions[bot] commented 2 months ago

Check out this weeks reading discussion https://github.com/earthlab-education/Earth-Analytics-AY24/discussions/41

Originally posted by **eculler** August 27, 2024 Before class on Week 2 (September 4/5), read the following short articles, respond to them in this discussion thread, and prepare to discuss them in class: - [Revolutionizing the map](https://scitechdaily.com/revolutionizing-the-map-how-smartphones-and-crowdsourcing-are-redefining-geospatial-data/). You can also check out the original [review article](https://spj.science.org/doi/10.34133/remotesensing.0105) - [Open Street Maps -- How to contribute](https://wiki.openstreetmap.org/wiki/How_to_contribute) As a guide for your response, you can consider: - What are some of the benefits or drawbacks to crowd-sourced data? - As a data scientist, what are some things you should consider or account for when using crowd-sourced data? - How does crowd-sourced data relate to open science? How does it contribute to and/or complicate open science efforts?
brglea commented 2 months ago

Crowd sourced data has drastically changed the data landscape in many beneficial ways; crowd sourced data provides real time, community driven data that ordinary people can contribute to (which is not the case with authoritative data sources). While this democratization of data and diversified types of data available has led to richer multifaceted insights and shifts across industry types, this comes with drawbacks.

As a data scientist it is important to account for the potential drawbacks when using crowd sourced data. Data quality and accuracy, data privacy, potential lack of sustainable data, legal and ethical issues, data biases, and data interpretation are all things to consider. There are possible ways to try to account for these considerations such as: assessing and mitigating data biases, executing quality assessments, and having stringent agreements with third parties/ clear explicit consent with volunteers. It is essential to know your source to properly account for the corresponding drawbacks.

Crowd sourced data, like ‘open science’, is aimed to be open to everyone to contribute to or access regardless of expertise, demographics, etc. Crowd sourced data can add potential new insights to open science through the expanded data sources; however, there would likely be difficulty in being able to reproduce or replicate crowdsourced data which would complicate open science efforts of replication or reproducibility.

Although the challenges of crowd sourced data create complexity to using it, crowd sourced data’s potential should not be underscored.