relationships - Githubissues

OSU-NLP-Group / HippoRAG

HippoRAG is a novel RAG framework inspired by human long-term memory that enables LLMs to continuously integrate knowledge across external documents. RAG + Knowledge Graphs + Personalized PageRank.

MIT License

1.23k stars 100 forks source link

In retrieval of the most relevant documents, you use the knowledge graph ( with noun_phrases as nodes and E + Ep edges) and you use P matrix ( noun_phrases occurrences in passages) and Embeddings of noun_phrases.

The retrieval process extracts entities from query and map them to entities in the KG and run PPR (personalized PageRank) to adjust the probabilities of the most relevant nodes of the KG that are important for answering the query. Then, you use this adjusted probability vector and multiply it to P matrix to rank the passages for passing to the LLM for drafting the final answer to the query from fetched passages.

The question is, (1) why didn't you use relationships in triples in constructing the KG so that later you can extract both entities and relationships from query and search through the graph with considering both entities and relationships?

(2) Is there any recent research paper that does (1)?

OSU-NLP-Group / HippoRAG

relationships #38