KenjiPcx / IdeaGen

0 stars 0 forks source link

Build a RAG agent #5

Open KenjiPcx opened 6 days ago

KenjiPcx commented 6 days ago

The main power feature is in this agent, it will

This is where we try to fit as much nvidia products as possible. I prefer using langgraph if possible

  1. Use Nvidia inference engine and choose their latest NVLM model, i think D is the powerful one?
  2. User enters problem they're trying to solve
  3. RAG pipeline:
    • enrich search query (query expansion)
    • embed search query using same embedding model as the one used in llamaindex
    • reranking results (which result is actually useful)
  4. Summarize results