neomatrix369 / learning-path-index

A repo with data files, assets and code supporting and powering the Learning Path Index Project
MIT License
16 stars 17 forks source link

Project 2: Create a RAG + Vector DB + LLM pipeline #70

Open TobeTek opened 2 months ago

TobeTek commented 2 months ago

Create a pipeline that integrates Retrieval-Augmented Generation (RAG) based on the LPI dataset, a vector database, and a Large Language Model (LLM).

Tacoman99 commented 1 month ago

Notebook showing a example using advance rag techniques such has auto retrieval and hybrid search on the LPI Dataset using Llama Index, Gemini and weaviate vector database

Findings: Rag is able to provide the correct courses from a users query but the Gemma2b model struggles on returning a desired output as seen in the the notebook. Copying the fewshot templates inputting infomation instead of using the contextual infomation Next Steps:

https://www.kaggle.com/code/tacoman789/gemma-few-shot-learning-with-llamaindex?scriptVersionId=202917060

Tacoman99 commented 3 weeks ago

2nd version

https://www.kaggle.com/code/tacoman789/gemma-few-shot-learning-with-llamaindex/notebook#Testing-a-different-prompt