Open mjy opened 7 months ago
Please clarify if fieldIdentifier
is actually fieldNumber
? ( https://dwc.tdwg.org/list/#dwc_fieldNumber, currently mapped as verbatim_trip_identifier )
eventID
is mapped as TripCode
identifier and the namespace is either assigned from non-standard TW:Namespace:eventID
field, or from the auto-generated per import dataset one (there was a bug with this one https://github.com/SpeciesFileGroup/taxonworks/commit/73b187deb60c1c4568904da90bfe7e40fe4a975b)
At present it is eventID
that serves as a way to detect that two events are the same, scoped either to a specific TW:Namespace:eventID
, or a import dataset-specific namespace. When a match is found, existing CE is used rather than creating a new one and merge data.
Proposal is new namespace, probably global (although would break stuff), to identify CEs?
Sorry for label confusion, yes fieldNumber
Making a note that TW:Namespace:eventID
matches against Namespace#short_name
Identifier::Local::Event
)
Identifier::Local::TripCode
for `Identifier::Local::FieldNumber'
Migrate all Identifier::Local::TripCode to FieldNumberAdd constraints per type:
verbatim_field_number
must equal cached
if corresponding Identifier::Local::FieldNumber
is also providedEnsure we handle DwC import for
Ensure DwcOccurrenceHooks are fired
Ensure DwcOccurrence Index is written
verbatim_trip_identifier
to dwc_field_number
Uniqueness/re-use
eventID
without TW:Namespace:eventID
is providedeventID
AND TW:Namespace:eventID
are provided
fieldNumber
fieldNumber
and eventID
provided then
fieldNumber
and verbatim_trip_identifer
fieldNumber
to verbatim_trip_identifier
verbatim_trip_identifier
instantiate a fieldNumber
eventID
eventID
without any Namespace will generate a new per-dataset namespace fieldNumber
fieldNumber
without any Namespace will Raiseverbatim_trip_identifier
TW:CollectingEvent:verbatim_trip_identifier
)catalogNumber
and recordNumber
catalogNumber
recordNumber
Thanks to @debpaul's testing we have some things that we should work to improve on. The basic issues are included in the image below.
@LordFlashmeow, @LocoDelAssembly please chime in after digesting this. In particular we need to re-vist how AntWeb is getting data in-out, as they have most extensively imported identifiers.
I do anticipate a couple things:
fieldNumber
.fieldNumber
as a ways to collate unique CEs, as we do foreventID
(we'd have to prioritize what to use if both present- something like useeventID
for uniquness and add multple TripCode identifiersIn general we also need to interpret
eventID
. To me it should reflect the digital record, i.e.eventID
is roughly 1 to 1 with a single TW CEThoughts?