SAA-SDT / eac-cpf-schema

https://eac.staatsbibliothek-berlin.de/
10 stars 4 forks source link

<otherRecordId>: definition #54

Closed SJagodzinski closed 4 years ago

SJagodzinski commented 6 years ago

(Re)define <otherRecordId> in context of the (re)definition of <recordId> (#53) and <entityId> (#55)

Creator of issue

  1. Silke Jagodzinski
  2. TS-EAS: EAC-CPF subgroup
  3. s.jagodzinski@bundesarchiv.de or silkejagodzinski@gmail.com

The issue relates to

Wanted change/feature

(Re)define <otherRecordId> in context of the (re)definition of <recordId> and <entityId>, if necessary

Suggested Solution

Widen the scope for this element (cf. usages of this element in projects like SNAC, ARDC Party Infrastructure, etc.)

Two different uses for this element:

Schema: create an appropriate attribute/element for the provenance (source repository/system) of the <otherRecordId> = element or attribute that would allow to include the identifier of the institution.

Tag Library: the connection is not reflected between this record and another record that describes the same entity in another system.

Context

Derived from #25. Issue #25 is split into 3 issues and is closed.

Tag Library:

Alternative record identifiers that may be associated with the EAC-CPF instance.

Description and Usage: An element in <control> used to encode record identifiers that are alternatives to the mandatory identifier in <recordId>. These might include the identifiers of merged EAC-CPF instances representing the same CPF entity or those of records that are no longer current but had some part in the history and maintenance of the EAC-CPF instance. The localType attribute can be used to identify the kind of institution or service responsible for each associated record identifier if not the same as that given in the <maintenanceAgency> element for this EAC-CPF instance.

Comments APEx, May 2013: <otherRecordID>: The @localType attribute of <otherRecordId> can be used to identify the kind of institution or service responsible for each associated record identifier. The @xml:id attribute of <otherRecordId> serves to name specific elements in an EAC-CPF instance. Question: How to identify the archival institution that has created an alternative record?

SJagodzinski commented 6 years ago

The Archives Portal Europe is using the element <otherRecordId> in the following way:

In case, existing EAC-CPF instances are delivered to the Archives Portal Europe and are converted to apeEAC-CPF, this conversion would ensure, that the original <recordId> in the content provider’s system is kept by transforming it to <otherRecordId> with the localType=”converted”. As <otherRecordId> is repeatable, it would in parallel be possible to also include existing <otherRecordId> elements from the original files, which might be useful for the question of re-directing to content providers’ systems.

For files exported from local systems to the Archives Portal Europe, there’d be a check on the <recordId> in these exports being unique and compliant to the identifiers’ format as chosen for the Archives Portal Europe. When both these requirements are met, the identifier as delivered in <recordId> would be kept. When the test on one of these requirements should be failing, the same principle would apply to these files as to the ones being converted – the original <recordId> would be moved to <otherRecordId> and an identifier for the use in the Archives Portal Europe would be created as new content for <recordId>.

See: State of the art report on EAC-CPF and recommendations for implementation in Archives Portal Europe, chap 4.1.2, p 54.

Examples for encoding of <otherRecordId> in APE:

gerhardmueller commented 6 years ago

To add to @SJagodzinski: for the Kalliope Union Catalog we have chosen a similar approach <otherRecordId localType="DE-588">(DE-588)11850391X</otherRecordId> where localType is an identifier or name of the data provider/provenance, in this case localType="DE-588" is the ISIL of the Integrated Authority File (ex.: http://kalliope-verbund.info/de/xmleac?eac.id=11850391X)

fordmadox commented 6 years ago

I checked the Connecting the Dots project, and the only examples of otherRecordId in that project mirror the examples in the EAD3 tag library (as well as one of the examples of how otherRecordId is used in the Kalliope Union Catalog).

Here's one example, for the record about "Burney, Charles, 1726-1814":

<recordId>FHCL.HOUGH:8822109</recordId> <otherRecordId localType="url">http://nrs.harvard.edu/urn-3:FHCL.HOUGH:8822109</otherRecordId>

The record also lists the LCNAF record as a "source," where the other record identifier URI can be accessed via the href attribute:

` <source xlink:actuate="onRequest" xlink:href="http://lccn.loc.gov/n50032067" xlink:show="new" xlink:type="simple">

Library of Congress Control Number: n 50032067
        </source>`

Were the LC authority record not used as a source, though, then I'm not sure how that project would have decided to record external record identifiers. entityId (or identityId) could be the best place (if it's agreed that the authority record ID used by systems like VIAF represents the entity and not the authority record???), but right now there is not a good way to record a URI on any of those elements (entityId, recordId, or otherRecordId). It can be done, as the examples prove, but I think that I'd prefer that the schema to enforce the use of attributes for storing URIs.

Last, were this record ingested into SNAC in order to enhance http://snaccooperative.org/view/75601210, then I think that's where more discussion would need to happen. How is this handled in APE, Kalliope, etc.? Not only is the record ID worth considering, but even more importantly, how would you handle the statements listed, how they're cited, etc.? But citations and such are another matter altogether, albeit a crucial one for the major revision, I think.

gerhardmueller commented 6 years ago

To add to my previous and Marks comments:

For the Kalliope Union Catalog, we deal with various identifier from different systems, but especially authority records. However, there is in this context - at least from my current understanding - no difference between a local or authority record describing an entity.

<recordId>300755-8</recordId> (ex.: Suhrkamp Verlag, http://kalliope-verbund.info/gnd/300755-8  )

This identifier is the primary identifier to identify the record describing the entity Suhrkamp Verlag in the Kalliope Union Catalog.

There are, however, additional identifier from different systems:

<otherRecordId localType="PPN">003007553</otherRecordId>
<otherRecordId localType="DE-611_URI">http://kalliope-verbund.info/gnd/300755-8</otherRecordId>
<otherRecordId localType="DE-588_URI">http://d-nb.info/gnd/300755-8</otherRecordId>
<otherRecordId localType="DE-588">(DE-588)300755-8</otherRecordId>
<otherRecordId localType="DE-101">(DE-101)003007553</otherRecordId>
<otherRecordId localType="VIAF">150718921</otherRecordId>
<otherRecordId localType="WikiData">Q301609</otherRecordId>
<otherRecordId localType="LCAuth">n86825104</otherRecordId>
<otherRecordId localType="LCAuth_URI">http://id.loc.gov/authorities/names/n86825104</
<otherRecordId localType="GNDinvalidId">(DE-588)1086213432</otherRecordId>
<otherRecordId localType="GNDinvalidId">(DE-588)1086110498</otherRecordId>
<otherRecordId localType="GNDinvalidId">(DE-588)1072892618</otherRecordId>
<otherRecordId localType="GNDinvalidId">(DE-588)108523696X</otherRecordId>
<otherRecordId localType="GNDinvalidId">(DE-588)4058533-5</otherRecordId>
<otherRecordId localType="SWDInvalidId">(DE-588c)4058533-5</otherRecordId>
<otherRecordId localType="GKDInvalidId">(DE-588b)300755-8</otherRecordId>

The element otherRecordId is used for both additional identifier assigned by database systems of the Kalliope Union Catalog and external systems providing records about the same entity. At least by now, I did not considered record identifier from library or archival systems as identifier assigned to an entity which would be indeed encoded with entityId. As far as I understood the TL definition: "identifiers that are alternatives ... These might include the identifiers of merged EAC-CPF instances representing the same CPF entity or those of records that are no longer current but had some part in the history and maintenance of the EAC-CPF instance." I simply replaced EAC-CPF instances simply by records describing a CPF entity in any system for I consider EAC-CPF as an encoding schema mainly for data exchange purposes.

Identifier I considered for being encoded with entityId would be any identifier "used to designate the entity" and "typically assigned by an authoritative agency" which would be - again at least from my perspective by now - any real world corporate body, e.g. tax office assigning a tax id, the district court assigning a company a commercial register id (in Germany, e.g. Suhrkamp HRB 163774 B), the vat registration number (USt-ID), etc.

<entityId localType="HRB">163774 B</entityId>
<entityType>corporateBody</entityType>
<nameEntry>
    <part>Suhrkamp Verlag</part>
</nameEntry>

Last but not least, by now I understood the element source to name the sources used to describe a CPF entity, like archival records, biographies, lexica or - also - records linked via otherRecordId (VIAF, GND, ...).

It would be, however, a very good idea to go on with the discussion on this and in addition to that - and as already addressed yesterday in our call - if the elements and attributes are sufficient, e.g.

SJagodzinski commented 5 years ago

Gerhard pointed out, that also SNAC is using the element <otherRecordId> for non-EAC-CPF records, e.g. Hannah Ahrendt.

Kalliope is using <recordId> for their own ID, that also could be an authority file identifier (in case Kalliope creates this authority file as partner of the authority file agency). If so, the same ID would be repeated in <otherRecordId> and specified with @localType attribute, e.g. Hannah Arendt. Again <otherRecordId> contains an non-EAC-CPF record ID.

Hence, SNAC and Kalliope seem to use <otherRecordId> for any (record) identifier of a third system, specified by @localType and independently from the records format, i.e. EAC-CPF record or not.