When Knowledge expands, wisdom shrinks

Hi @imvetri

Thank you for your insights. Regardless of the scale of the knowledge graph (the larger, the better), retrieving information always results in the formation of a subgraph centered around the main query entity, facilitating low-cost computation. This is highly beneficial for multi-hop reasoning. However, the knowledge graph itself does not undergo memory compression. The more comprehensive the graph, the greater the volume of accessible information for responding to queries.

The memory stream tracks users' exposure to various concepts - their 'wisdom' or breadth of knowledge - while the entity knowledge store assesses the depth of their understanding.

Currently, the only form of memory compression applies to the entity knowledge store, where we compress the top N entities. These are then provided to the LLM finite context window with specific instructions to avoid detailed explanations, enhancing personalization for users already familiar with these concepts.

Screenshot 2024-04-30 at 9 40 09 AM

We hope to see improvements in the memory compression implemented as its fairly basic. One future contribution listed in the README is that instead of compressing the most frequent entities (according to their counts), we could see improved results by compressing the entities that are included in the current query. This improvement will enable the LLM to more effectively determine which concepts require detailed explanations and which can be addressed more succinctly in response. We invite you to contribute to this development!

We also have compression for the chat history using an eviction policy but that's a bit outside the scope of this discussion. Hope this helps.

Screenshot 2024-04-30 at 9 41 01 AM

kingjulio8238 / Memary

When Knowledge expands, wisdom shrinks #17