ualbertalib / metadata

UAL metadata team's repository
14 stars 6 forks source link

Generate full MARCXML xpath reports for all IA collections #95

Closed sfarnel closed 7 years ago

sfarnel commented 8 years ago

Explore MARCXML data for IA collections and generate xpath report (similar to what was done for Peel MODS metadata) so that we can begin to map for ingest into ERA.

sfarnel commented 8 years ago

Homestead records and Technocracy are excluded (possible going into AtoM) as is Nosotros (for Avalon).

anayram commented 8 years ago

@sfarnel @zschoenb

Sounds good; those collections have no marc records (albertahomestead, ualberta_technocracy, nosotrostv)

@sfarnel I am assuming we can also exclude the following collections from the report, can you confirm? albertapostcards - 5 items (no marc) cius_newsletters - 1 item (no marc)

sfarnel commented 8 years ago

Yes, ignore the postcards.

Much of the CIUS material will have MARC records; some will not. Let's include what we can and generate reports for those that don't based on what metadata there is.

On Tue, Oct 11, 2016 at 10:07 AM, Mariana Paredes-Olea < notifications@github.com> wrote:

@sfarnel https://github.com/sfarnel @zschoenb https://github.com/zschoenb

Sounds good; those collections have no marc records (albertahomestead, ualberta_technocracy, nosotrostv)

@sfarnel https://github.com/sfarnel I am assuming we can also exclude the following collections from the report, can you confirm? albertapostcards - 5 items (no marc) cius_newsletters - 1 item (no marc)

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/ualbertalib/metadata/issues/95#issuecomment-252963761, or mute the thread https://github.com/notifications/unsubscribe-auth/AEevTOrY3Lsw1200sEmVzvIqsDnR64kjks5qy7RNgaJpZM4KJNLT .

Sharon Farnel Metadata Coordinator University of Alberta Libraries sharon.farnel@ualberta.ca 780-492-3685

anayram commented 7 years ago

To do:

@sfarnel, for the report, do you want us to also merge unicorn ids from UAL's MARC XML? or should we do the merging once we transform for migration? @zschoenb

sfarnel commented 7 years ago

I have no preference. So if one is easier than the other then go with that.

On Thu, Nov 24, 2016 at 10:11 AM, Mariana Paredes-Olea < notifications@github.com> wrote:

To do:

  • Extract files for items with no MARC XML
  • Break MarcEdit files
  • Generate reports

@sfarnel https://github.com/sfarnel, for the report, do you want us to also merge unicorn ids from UAL's MARC XML? or should we do the merging once we transform for migration? @zschoenb https://github.com/zschoenb

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/ualbertalib/metadata/issues/95#issuecomment-262819771, or mute the thread https://github.com/notifications/unsubscribe-auth/AEevTF1jfybYmGNtw7BSqjLv5Qd4yfRzks5rBcVJgaJpZM4KJNLT .

-- Sharon Farnel Metadata Coordinator University of Alberta Libraries sharon.farnel@ualberta.ca 780-492-3685

anayram commented 7 years ago

We will have to do the merging for migration into ERA so I am leaving that item for later.

In the report we are including:

  1. IA object link (landing page). E.g. https://archive.org/details/xvixviiiukrainia50masl
  2. Full MARC record link. E.g. https://archive.org/download/xvixviiiukrainia50masl/xvixviiiukrainia50masl_marc.xml

@sfarnel - do you see any benefit in including a link to Open Library landing page as well? E.g. http://www.openlibrary.org/books/OL750528M for same object in above examples.

sfarnel commented 7 years ago

This is great @anayram; thanks!

At this point I don't think we need worry about the Open Library link.

On Mon, Nov 28, 2016 at 10:32 AM, Mariana Paredes-Olea < notifications@github.com> wrote:

We will have to do the merging for migration into ERA so I am leaving that item for later.

In the report we are including:

  1. IA object link (landing page). E.g. https://archive.org/details/ xvixviiiukrainia50masl
  2. Full MARC record link. E.g. https://archive.org/download/ xvixviiiukrainia50masl/xvixviiiukrainia50masl_marc.xml https://archive.org/download/xvixviiiukrainia50masl/xvixviiiukrainia50masl_marc.xml

@sfarnel https://github.com/sfarnel - do you see any benefit in including a link to Open Library landing page as well? E.g. http://www.openlibrary.org/books/OL750528M for same object in above examples.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/ualbertalib/metadata/issues/95#issuecomment-263337174, or mute the thread https://github.com/notifications/unsubscribe-auth/AEevTNIkUNlf_yEPqv3g2-dTpbRZvyxaks5rCxA3gaJpZM4KJNLT .

-- Sharon Farnel Metadata Coordinator University of Alberta Libraries sharon.farnel@ualberta.ca 780-492-3685

anayram commented 7 years ago

IA report files available from: Drive: https://drive.google.com/open?id=0B-wPdqHEduXtU2c1dElZNmxZOUU or Github: https://github.com/ualbertalib/metadata/tree/master/metadata-wrangling/internet_archive_coll/reports