ncss-tech / soilReports

An R package that assists with the setup and operation of a collection of soil data summary, comparison, and evaluation reports. These reports are primarily used by USDA-NRCS soil scientists in both initial and update mapping.
15 stars 5 forks source link

"to check" polygon output #34

Closed brownag closed 7 years ago

brownag commented 7 years ago

Add an additional .SHP file to output that includes information regarding polygons with above-threshold proportion of sample values outside 5-95% percentile range for the MU.

Currently, statistics shapefile output contains median values for each raster and the "toCheck" flag which is a ranking based on the amount of samples outside range. This polygon is useful for symbolizing where problematic polygons occur, and also the distribution of abiotic factors (vis-à-vis the median) across a MU extent... but does not tell why a particular polygon is flagged. This can rarely be determined from the median values unless it is a very extreme case.

The new shapefile will follow same format as the stats shapefile (1 column per raster) only instead of medians, contains the "proportion of samples outside range" for each raster data source.

This will reduce the iterative process of looking up polygon IDs to see why they were flagged. Currently that information is only available in the tabular output at the end of the report HTML file.

In addition to reducing iteration between the report and the shapefile display in e.g. ArcMap this will allow symbolizing of MUs based on proportion outside range for INDIVIDUAL data sources rather than an aggregate of all data sources.

brownag commented 7 years ago

Should this shapefile include all polygons in the analysis? Or just those above the arbitrary 0.15 threshold?

I tend to think it should mirror the stats file exactly, regardless of the proportion outside range. I have found that 0.15 may be too stringent of a threshold in some cases. Though conceivably for some data sources (say, slope) you could want a smaller threshold? Just putting the proportions in without filtering would allow the user to symbolize the data however they want in GIS.

brownag commented 7 years ago

Issue resolved with commit f14f80d675d80a77e4bf4949bf2761c1587d17b5. Minor aesthetic changes made with commit fa9ce1c119925f38cb2d81540db7738c5a6e3a83.