USEPA / Phytoplankton-Data-Analysis

Phytoplankton Data Analysis
3 stars 0 forks source link

taxa = ND #33

Closed jbeaulie closed 10 years ago

jbeaulie commented 10 years ago

In sheet_id 1063 (HAB 2EFR 2013 07 02 (2).xlsx) no blue-green algae were detected at several sites. The taxa was reported as 'ND' and the cell_per_l field was left blank. This should be coded as described in issue #3 . Taxa as NA, the Class as Blue-Green, the Cell.L as 0, and the BV.um3.L as 0.

mjpdenver commented 10 years ago

I corrected this - but note that throughout the dataset, the class field is largely unpopulated. Presumably this will be filled in by merging with a master-tax list. I am wary of using NA as a character string because someone ( namely me in the future) will think it was a missing field recoded as string. I have coded this taxa as NBF = no bacteria found.

mjpdenver commented 10 years ago

I notices about 37 rows with no taxa listed. An example is in drew/k/92RAWDAT.xls sheet EFR

screen shot 2014-05-01 at 9 14 36 am

jbeaulie commented 10 years ago

-Only 37 observations, that is good. See Issue #27 for discussion of these data. I think they should be omitted. -The NBF designation sounds good, though NAF may be better (no algae found). -I am fine with leaving the class field blank for these observations, as long as hab=T. At the back end we'll do something like: algae[algae$hab == TRUE, "class"] = "Blue-green"

mjpdenver commented 10 years ago

Relabeled NAF in readHad.R script. Records with no taxa reported dropped in the algaeCheck.R script.