Open lwasser opened 9 years ago
The current lesson 1 includes a discussion of bad data, and has the learners explore the data to see if there are any bad values. There are none in the data set being used.
Should this issue be closed? Or do we still want to add some bad data values before lesson release? My preference is not to change the underlying data at this point, as it will require some extra clean-up. Pedagogically, I do think it's a good idea to have the learners find and correct or remove bad data points. Maybe we can have this added after the first publication? I'm proposing a "status:wait" label.
yes. it would be easy to modify the data but there is no good solution. i think a bit of discussion about bad vs missing data and options toa ddress it (without actually fixing it) could also work. The key learning lesson here i think is just to
Some of the data has missing values, and this appears in the plots, for example: NEON-DS-Airborne-Remote-Sensing/HARV/DTM/HARV_DTMhill_WGS84.tif which is visualized in https://datacarpentry.org/r-raster-vector-geospatial/03-raster-reproject-in-r/index.html perhaps more could be done to understand this.
I like the idea. Does this mean adding a section to remove bad data values? Acknowledge issue in min/max section and then deal with it in bad data section?