Open jdunic opened 1 year ago
This is NOT for the IPHC survey, correct? (Then, sorry, I'm not sure I can help).
I know I had similar issues for IPHC survey (and maybe for data that was in gfbio, not just in gfiphc). Is this only happening occasionally? I'm guessing not, but sometimes a line breaks or something, and not all the hooks get fished. Someone dropped their recording sheet over the side of the boat once, and it took me half a day to track that problem down.
Aha - total_hooks
might be total hooks deployed? And hence always <= hook_count
.
Correct, I only checked this difference for HBLL INS and specifically only compared information using Yelloweye.
Interestingly... total_hooks is NOT always <= hook_count... And the total_hooks calculation subtracts out bent/broken hooks. But I hadn't thought about those other reasons for missing hooks 😆that is helpful.
There seems to be a discrepancy in the data coming from
gfdata::get_survey_sets()
andgfdata::get_ll_hook_data()
. Fromget_ll_hook_data()
, it looks likecount_target_species
andcount_non_target_species
are not always correct. And I am unclear of where bait counts come from for thesefishing_event_id
values that correspond with these incongruent values.Setup data to combine hook counts (baited, empty, total) with survey data
Expected results
1. Number of rows from
get_survey_sets()
andget_ll_hook_data()
should be equal2.
total_hooks
should equalhook_count
3.
count_target_species
should equalcatch_count
Current results
get_ll_hook_data()
thanget_survey_sets()
.From what I can tell, I think that this is because of a difference in checking for usable sets in
get_ll_hook_data
.2.
total_hooks
does not equalhook_count
(fromget_survey_sets()
)in some cases.hook_count
looks like the correct value.3.
catch_count
which comes fromget_survey_sets()
looks like the correct value. While catch information fromget_ll_hook_data()
looks wrong.Some examples of 2 and 3:
fishing_event_id == 2193813
: count of target species (count_non_target_species
) is 0, but should be 3fishing_event_id == 309482
: count of target species is 15 but should be 16fishing_event_id == 1722030
: count of target species is 19 but should be 18fishing_event_id == 1722028
: count of target species is 2 but should be 1fishing_event_id == 4590043
: count of non_target is 0, but 141 dogfish were caught, which looks like whytotal_hooks
is incorrect4. When confirming that the bait and empty counts are correct, I cannot find the
fishing_event_id
s from the example above inHOOK_SPECS
, so where is thecount_bait_only
coming from inget_ll_hook_data()
? I might just be confused here. How do the other values in theFISHING_EVENT
table come into play?