scientist-softserv / oral-history

UCLA LIBRARY-CENTER FOR ORAL HISTORY RESEARCH --Documenting the histories of Los Angeles-- The UCLA Library creates a vibrant nexus of ideas, collections, expertise, and spaces in which users illuminate solutions for local and global challenges. We constantly evolve to advance UCLA’s research, education, and public service mission by empowering and
https://oralhistory.library.ucla.edu/
0 stars 0 forks source link

No results show up in full text search #66 #10

Closed labradford closed 7 months ago

labradford commented 1 year ago

When entering a search term using "full text search" no results are retrieved (example: jazz, hollywood)

However, results are retrieved when using "interview information."

See screenshot.

Image

labradford commented 1 year ago

Lea Ann Bradford April 2022

When we are trying to index the transcript, we are getting an empty string. For example, the transcript url in the feed is http://digital2.library.ucla.edu/dlcontent/oralhistory/text/masters/21198-zz00293fc4-2-master.xml but that doesn’t seem to exist.

labradford commented 1 year ago

Lea Ann Bradford April 2022

Slack thread:

Lea Ann Bradford (she/her):cherry_blossom: 6 days ago I think I may have figured it out. The full text search only returns search matches for the transcripts. When we are trying to index the transcript, we are getting an empty string. For example, the transcript url in the feed is http://digital2.library.ucla.edu/dlcontent/oralhistory/text/masters/21198-zz00293fc4-2-master.xml but that doesn’t seem to exist. Was there a change to that server or something?

Lea Ann Bradford (she/her):cherry_blossom: 6 days ago @t-kay

T-Kay Sangwand 6 days ago @Kristian Allen - UCLA can you respond?

Kristian Allen 3 days ago Thanks @leaannbradford, sorry for the delay on this, for some reason I didn't' see until now. The url did change, can you confirm the field you are using? The reason I ask is the remainder of the transcript and text related downloads seem to be picking up the correct url, so i'm trying to narrow down where this is coming from example, under downloads on https://oralhistory.library.ucla.edu/catalog/21198-zz00096t1v?counter=1 They should all be of format static.library.ucla.edu/oralhistory...

Lea Ann Bradford (she/her):cherry_blossom: 2 days ago We are getting that info from this field: mods:location/mods:url[@usage=“timed log”] I don’t have access to the http://static.library.ucla.edu/oralhistory link so I wasn’t able to see the format

Lea Ann Bradford (she/her):cherry_blossom: 2 days ago This is the url for our OAI feed https://webservices.library.ucla.edu/dldataprovider/oai2_0.do

Lea Ann Bradford (she/her):cherry_blossom: 2 days ago This is an example of one. You can see that it doesn’t seem to exist. http://digital2.library.ucla.edu/dlcontent/oralhistory/text/masters/21198-zz00293fc4-2-master.xml

digital.library.ucla.edudigital.library.ucla.edu UCLA Library Digital Collections The UCLA Library Digital Collections includes rare and unique digital materials developed by the UCLA Library to support education, research, service, and creative expression. This website is our new interface for discovery and engagement of these collections. See collections of historic photographs and manuscripts. More collections added weekly.

Kristian Allen 2 days ago thanks @leaannbradford, that is the correct url for the OAI feed, the assets should be in the static domain now, ie from example above https://static.library.ucla.edu/oralhistory/text/masters/21198-zz00293fc4-2-master.xml I'll also confirm mods:location/mods:url[@usage="timed log"] is providing correct url

Kristian Allen 2 days ago so in the OAI feed there is https://webservices-test.library.ucla.edu/dldataprovider/oai2_0.do?verb=GetRecord&identifier=[…]la.edu:digital2/21198-zz00293fc4&metadataPrefix=mods

Kristian Allen 2 days ago

https://static.library.ucla.edu/oralhistory/text/masters/21198-zz00293fc4-2-master.xml Kristian Allen 2 days ago @leaannbradford can you confirm the record you are seeing the 'timed log' value in? I'm trying to locate that Lea Ann Bradford (she/her):cherry_blossom: 2 days ago https://oralhistory.library.ucla.edu/catalog/21198-zz00293f2g Kristian Allen 1 day ago this is the mods record for that entry https://webservices-test.library.ucla.edu/dldataprovider/oai2_0.do?verb=GetRecord&identifier=[…]la.edu:digital2/21198-zz00293f2g&metadataPrefix=mods The timed log in this case are pointed to wowza, I think this must be incorrect? I don't think we are indexing wowza streaming entries Lea Ann Bradford (she/her):cherry_blossom: 1 day ago I’m not sure how it works, but it grabs the text from that URL and this is the value “http://digital2.library.ucla.edu/dlcontent/oralhistory/text/masters/21198-zz00293fc4-2-master.xml” Then it uses an XSLT file to convert it digital.library.ucla.edudigital.library.ucla.edu UCLA Library Digital Collections The UCLA Library Digital Collections includes rare and unique digital materials developed by the UCLA Library to support education, research, service, and creative expression. This website is our new interface for discovery and engagement of these collections. See collections of historic photographs and manuscripts. More collections added weekly. Lea Ann Bradford (she/her):cherry_blossom: 1 day ago This is the code. It has been unchanged since 3/2019 ``` time_log_url = child.elements['mods:location/mods:url[@usage="timed log"]'].text

transcript = self.generate_transcript(time_log_url)



> Kristian Allen  1 day ago
thanks @leaannbradford, I think we (ucla) must be putting the incorrect link in this field, i'll verify an example. I do see correct domain (static) but it seems to be pointing to wowza links, which would explain why it is not being indexed
New

> Kristian Allen  8 minutes ago
Giving an update on this, I fixed our field to match the correct output and am deploying an updated OAI provider. No changes are required from Notch8
I'll reharvest on our end, thank you @leaannbradford for confirming the field that is being used

> Lea Ann Bradford (she/her):cherry_blossom:  3 minutes ago
Thanks for the update!