Build a RAG agent - Githubissues

KenjiPcx / IdeaGen

0 stars 0 forks source link

Build a RAG agent #5

Open KenjiPcx opened 6 days ago

KenjiPcx commented 6 days ago

The main power feature is in this agent, it will

Take in a user problem
Build a step by step how to build a startup document to solve this problem, containing
- The tools that could be used
- The team, number of people
- The reddit post url as reference

This is where we try to fit as much nvidia products as possible. I prefer using langgraph if possible

Use Nvidia inference engine and choose their latest NVLM model, i think D is the powerful one?
User enters problem they're trying to solve
RAG pipeline:
- enrich search query (query expansion)
- embed search query using same embedding model as the one used in llamaindex
- reranking results (which result is actually useful)
Summarize results