digitalutsc / islandora_lite_docs

Contains a Wiki with documentation for the UTSC Library's Islandora Lite System
GNU General Public License v3.0
2 stars 0 forks source link

Indexing WebVTTs in Solr Search #11

Closed kstapelfeldt closed 8 months ago

kstapelfeldt commented 3 years ago

Do we have a very strong use case for fieldable files?

kstapelfeldt commented 2 years ago

Hi @Natkeeran I thought this one was done, or in review. Let us know if you have any more details.

Natkeeran commented 2 years ago

@kstapelfeldt https://github.com/Natkeeran/webvtt There is some code in my local that I need to review/test and commit.

amym-li commented 1 year ago

The repo containing the current development work for structurally indexing WebVTTs can be found at https://github.com/digitalutsc/solr_vtt

The readme contains instructions for indexing files as fulltext fields using Search API Attachments and File Extractor, as well as information for indexing nested documents in Solr.

kstapelfeldt commented 1 year ago

@amym-li showed us how to do this. Couldn't get structured indexing out of it. Needs to be implemented in SOHP.

amym-li commented 1 year ago

WebVTTs indexed as fulltext is set up on SOHP now

https://sohp2.digital.utsc.utoronto.ca/collection/3?a%5B0%5D%5Bf%5D=all&a%5B0%5D%5Bi%5D=IS&a%5B0%5D%5Bv%5D=Burlington

https://stories.digital.utsc.utoronto.ca/node/3?a%5B0%5D%5Bf%5D=all&a%5B0%5D%5Bi%5D=IS&a%5B0%5D%5Bv%5D=Burlington

kylehuynh205 commented 9 months ago

Need Documentation

Natkeeran commented 8 months ago

Added documentation here: https://github.com/digitalutsc/islandora_lite_docs/wiki/2.-Basic-and-Advanced-Search#indexing-webvtt

We have a basic approach configured in the stories site.