Open hortongn opened 1 year ago
- https://www.aiforlibrarians.com/ai-cases/
Next steps:
Consider making use of the metadata tags that may already be embedded in a document (PDF, Word, etc.)
An AI toolkit for libraries (paper) https://insights.uksg.org/articles/10.1629/uksg.592
Integrating Ruby with OpenAI: A Beginner’s Guide https://ai.plainenglish.io/integrating-ruby-with-openai-a-beginners-guide-88ffaa10f202
GPT-JT is an open source GPT-3 alternative with a decentralized approach https://the-decoder.com/gpt-jt-is-an-open-source-gpt-3-alternative-with-a-decentralized-approach/
How to use Microsoft AI Builder to Extract Data from PDF https://www.youtube.com/watch?v=J3d6bx3i4l0&ab_channel=KevinStratvert
MS PowerAutomate (part of Office 365) https://powerautomate.microsoft.com
Interesting: Text Analytics APIs are machine learning-powered services that allow developers to analyze and extract insights from text-based data. These APIs use natural language processing (NLP) techniques to automatically identify and extract entities, sentiments, topics, and other relevant information from text.
Here's a high-level overview of how Text Analytics APIs work:
Some popular Text Analytics APIs include:
By using Text Analytics APIs, developers can leverage the power of machine learning to extract valuable insights from text-based data with minimal effort and expertise.
Create a list of models that we could potentially use to extract text from documents and suggest metadata. We will start with basic metadata like title, description, etc. and eventually move on to optional metadata fields found in Scholar.
We ideally want to use "machine learning as a service" options that will host things for us, but we can also explore open source options.