USEPA / Phytoplankton-Data-Analysis

Phytoplankton Data Analysis
3 stars 0 forks source link

EDD* Files: Missing Sample ID names or components. #23

Open mjpdenver opened 10 years ago

mjpdenver commented 10 years ago

I need guidance in creating a sample ID from the following files.

The following files tend to be missing ClientSampleID or the components needed to fill out the other fields

"Drew data/a/(-EFR-)EDD_1204846_East_Kent_WS_v1.xls" "Drew data/a/(EFR) EDD_1206519_East_Kent_WS_v2.xls" "Drew data/a/(EFR) EDD_1207784_East_Kent_WS_v4.xls"
"Drew data/g/EDD_1208475_East_Kent_WS_v1.xls"
"Drew data/g/EDD_1208476_East_Kent_WS_v1.xls"

jbeaulie commented 10 years ago

I have no idea how to interpret most of the records in these files. There are at least a few rows in each file that are identifiable, however.

"Drew data/g/EDD_1208475_East_Kent_WS_v1.xls" -rows 7-20 are identifiable

"Drew data/g/EDD_1208476_East_Kent_WS_v1.xls" -Identical to "Drew data/g/EDD_1208475_East_Kent_WS_v1.xls"

"Drew data/a/(-EFR-)EDD_1204846_East_Kent_WS_v1.xls" -Can get close on some of these, but no depth is reported.

"Drew data/a/(EFR) EDD_1206519_East_Kent_WS_v2.xls" -Rows 3-20 are identifiable

"Drew data/a/(EFR) EDD_1207784_East_Kent_WS_v4.xls" -Rows 7-23 are identifiable

mjpdenver commented 10 years ago

I recommend not reading data from these files until a version with all rows are readable because illegible records could cause issues when aggregating.