Open CECSpecialistI opened 3 years ago
@cspayne Added lines for breaking $v$x$y$z when $2=fast. Please check
The mapping shows that we are adding an authorized access point to IRIs provided in $0's and $1's for places with the authorized access point being the combined values of $a and $g. I have two questions related to this:
I suggest that we decide whether to use $a and $g or only $a and be consistent, as well as decide how these values should be combined (separated by a space or by --). We should also be consistent in minting a nomen for an authorized access point when we have a source and using a string value as an access point (not authorized) with no source.
If we are in agreement about this, I can update the mapping and the code.
@cspayne:
@cspayne Hi Cypress, for $0's and $1,
The mapping shows that we are adding an authorized access point to IRIs provided in $0's and $1's for places with the authorized access point being the combined values of $a and $g. 1, If we have no qualifier subfields, then AP/AAP is relevant, and it is useful to add the RDA AAP statement to the subfield $1 IRI as subject, with the value as a string object, no nomen, because we already 'know' the source from the IRI. (Gordon suggested this in previous #Subject Heading discussion)
2, We just map $a.
Here's one example from gnd for Mexico City:
<marc:datafield tag="651" ind1=" " ind2="7">
<marc:subfield code="a">Mexiko</marc:subfield>
<marc:subfield code="g">Stadt</marc:subfield>
<marc:subfield code="2">gnd</marc:subfield>
</marc:datafield>
@cspayne: The example of subfield $g is bad data, and unfortunately it may be a hint of a larger problem. The value 'stadt' is German for 'city', so the intended place is 'Mexico City' or 'Mexico (city)'. In both cases these should be recorded as subfield $a: the MARC 21 manual explicitly states for subfield $a 'Parenthetical qualifying information is not separately subfield coded'. I am wondering if this is an import artefact rather than a simple cataloguer error, arising from the inclusion of DNB authority data in M21. But we don't need to know how it has happened because the result for the transform is the same: only use $a for the AAP, as the manual instructs.
In the case of this example, this will output Mexico the country instead of Mexico City, but it is not invalid because Mexico City the place is part of Mexico the place :-)
If we can determine that if source is 'gnd' then subfield $g is a placename qualifier, then the values can be concatenated, but I think we have run out of time to make this a safe decision. There is an alternative: concatenate subfields $a and $g, but output the value as an AP, not an AAP.
I haven't found any with subfield $g that aren't from gnd, but I found a few more examples from gnd:
<marc:datafield tag="651" ind1=" " ind2="7">
<marc:subfield code="a">Südafrika</marc:subfield>
<marc:subfield code="g">Kontinent</marc:subfield>
<marc:subfield code="2">gnd</marc:subfield>
</marc:datafield>
<marc:datafield tag="651" ind1=" " ind2="7">
<marc:subfield code="a">Griechenland</marc:subfield>
<marc:subfield code="g">Altertum</marc:subfield>
<marc:subfield code="2">gnd</marc:subfield>
</marc:datafield>
<marc:datafield tag="651" ind1=" " ind2="7">
<marc:subfield code="a">East End</marc:subfield>
<marc:subfield code="g">London</marc:subfield>
<marc:subfield code="2">gnd</marc:subfield>
</marc:datafield>
@AdamSchiff do you know if $g is commonly used as part of the access point for gnd? If Adam doesn't know then I agree with Gordon, we should just use $a at this point.
If you look at a GND authority, the $g is being used as the qualifier to a heading, and I think it's an integral part of the authorized access point. You should not ignore it, as without the qualifier, the meaning is often ambiguous.
For example, for East End of London, here's the GND authority: https://d-nb.info/gnd/4138902-5
[cid:a79b446c-47eb-43ce-b087-0d8d9f2fcdf6] You can see that the authorized access point is East End (London), and the variant access point is London- East End. If you look at it in MARC21 you can see the encoding does use $g for the qualifier (London):
<?xml version="1.0" encoding="UTF-8"?>
I looked for other GND authorities by searching in Wikidata, and it appears that $g is used regularly for qualifiers for city sections. For example for the Schöneberg neighborhood in Berlin (https://d-nb.info/gnd/4106612-1):
<?xml version="1.0" encoding="UTF-8"?>
The qualifier in $g is also used for celestial bodies (https://d-nb.info/gnd/4179170-8):
<?xml version="1.0" encoding="UTF-8"?>
Here's the GND record for Mississippi River, which in GND is established as Mississippi (Fluss) (https://d-nb.info/gnd/4039589-3https://d-nb.info/gnd/4039589-3😄
<?xml version="1.0" encoding="UTF-8"?>
In all cases that I've seen, what's in $g is rendered in a label as a qualifier in parentheses:
East End (London)
Schöneberg (Berlin)
Saturn (Planet)
Mississippi (Fluss)
If you drop these qualifiers, the term is no longer clear. Does "Saturn" refer to a planet, a deity, or an automobile model? Does "Mississippi" refer to a state or a river?
Adam
Adam L. Schiff Principal Cataloger University of Washington Libraries (206) 543-8409 @.***
From: Cypress Payne @.> Sent: Monday, November 11, 2024 11:54 AM To: uwlib-cams/MARC2RDA @.> Cc: Adam L Schiff @.>; Mention @.> Subject: Re: [uwlib-cams/MARC2RDA] 651 subject added entry--geographic name (Issue #246)
I haven't found any with subfield $g that aren't from gnd, but I found a few more examples from gnd:
<marc:datafield tag="651" ind1=" " ind2="7">
<marc:subfield code="a">Südafrika</marc:subfield>
<marc:subfield code="g">Kontinent</marc:subfield>
<marc:subfield code="2">gnd</marc:subfield>
</marc:datafield>
<marc:datafield tag="651" ind1=" " ind2="7">
<marc:subfield code="a">Griechenland</marc:subfield>
<marc:subfield code="g">Altertum</marc:subfield>
<marc:subfield code="2">gnd</marc:subfield>
</marc:datafield>
<marc:datafield tag="651" ind1=" " ind2="7">
<marc:subfield code="a">East End</marc:subfield>
<marc:subfield code="g">London</marc:subfield>
<marc:subfield code="2">gnd</marc:subfield>
</marc:datafield>
@AdamSchiffhttps://urldefense.com/v3/__https://github.com/AdamSchiff__;!!K-Hz7m0Vt54!kbYtb7EmvtuwH_i121vG7pyd6QelTDUt5x4C_Tsn0-gPNBjRROepx1r-ewJEZQeouJfeHUdlJTrqv4h1eCQ3vRM$ do you know if $g is commonly used as part of the access point for gnd? If Adam doesn't know then I agree with Gordon, we should just use $a at this point.
— Reply to this email directly, view it on GitHubhttps://urldefense.com/v3/__https://github.com/uwlib-cams/MARC2RDA/issues/246*issuecomment-2468920399__;Iw!!K-Hz7m0Vt54!kbYtb7EmvtuwH_i121vG7pyd6QelTDUt5x4C_Tsn0-gPNBjRROepx1r-ewJEZQeouJfeHUdlJTrqv4h1UBgZS_g$, or unsubscribehttps://urldefense.com/v3/__https://github.com/notifications/unsubscribe-auth/ADFBVBZFG5PXHQA3W2ZNID32AEDOPAVCNFSM6AAAAABPU77WZSVHI2DSMVQWIX3LMV43OSLTON2WKQ3PNVWWK3TUHMZDINRYHEZDAMZZHE__;!!K-Hz7m0Vt54!kbYtb7EmvtuwH_i121vG7pyd6QelTDUt5x4C_Tsn0-gPNBjRROepx1r-ewJEZQeouJfeHUdlJTrqv4h1bUPPPDY$. You are receiving this because you were mentioned.
@cspayne, @AdamSchiff: So it looks like a systematic import/matching error: The GND data should have been concatenated and the qualifiers put in parentheses before importing to 651 subfield $a. I assume the transform can rectify this by detecting the source as GND and concatenating subfield $a and subfield $g (with added parentheses to conform to authority norms).
Don't know. We should be consistent and mint a nomen if there is source so we can preserve the source information.
I also think we should mint a nomen if there is a source. As we discussed yesterday, we cannot determine whether an access point is authorized based on the $1 value. So if there is an approved source with $1 shouldn't we mint a nomen for that authorized access point to retain the source information? If there is no approved source and a $1, that is when we do not mint a nomen.
@cspayne: I think we resolved these cases via our slides?
@GordonDunsire yes! We did :)
https://github.com/uwlib-cams/MARC2RDA/blob/main/Working%20Documents/6XX.csv