Closed brownag closed 7 years ago
Should this shapefile include all polygons in the analysis? Or just those above the arbitrary 0.15 threshold?
I tend to think it should mirror the stats file exactly, regardless of the proportion outside range. I have found that 0.15 may be too stringent of a threshold in some cases. Though conceivably for some data sources (say, slope) you could want a smaller threshold? Just putting the proportions in without filtering would allow the user to symbolize the data however they want in GIS.
Issue resolved with commit f14f80d675d80a77e4bf4949bf2761c1587d17b5. Minor aesthetic changes made with commit fa9ce1c119925f38cb2d81540db7738c5a6e3a83.
Add an additional .SHP file to output that includes information regarding polygons with above-threshold proportion of sample values outside 5-95% percentile range for the MU.
Currently, statistics shapefile output contains median values for each raster and the "toCheck" flag which is a ranking based on the amount of samples outside range. This polygon is useful for symbolizing where problematic polygons occur, and also the distribution of abiotic factors (vis-à-vis the median) across a MU extent... but does not tell why a particular polygon is flagged. This can rarely be determined from the median values unless it is a very extreme case.
The new shapefile will follow same format as the stats shapefile (1 column per raster) only instead of medians, contains the "proportion of samples outside range" for each raster data source.
This will reduce the iterative process of looking up polygon IDs to see why they were flagged. Currently that information is only available in the tabular output at the end of the report HTML file.
In addition to reducing iteration between the report and the shapefile display in e.g. ArcMap this will allow symbolizing of MUs based on proportion outside range for INDIVIDUAL data sources rather than an aggregate of all data sources.