topoteretes / cognee

Reliable LLM Memory for AI Applications and AI Agents
https://www.cognee.ai
Apache License 2.0
882 stars 70 forks source link

[COG-538] Identifying industry standard retrieval algorithms to implement #225

Open hajdul88 opened 6 hours ago

hajdul88 commented 6 hours ago

From SyncLinear.com | COG-538

hajdul88 commented 6 hours ago

I will double check when implementing the retrievers. To include it everywhere would be a bigger task, will see what is the state and update beginning of next week.

hajdul88 commented 6 hours ago

I thought it was there, maybe it got dropped in some refactor. Please include it if you can. Our initial concept I did had it

hajdul88 commented 6 hours ago

Here I was thinking and reading research papers. I think we will need to have embeddings of each nodes as well as each edges. Most of the non LLM based retrievals rely on node and edge importances that are ultimately vector similarities to the query. As I know for now we have embeddings only for the documentchunks.

For this cycle I can do embeddings of these on the fly but its not gonna be production ready but will work for testing purposes, later these will have to be moved to cognify pipeline and done when we are creating the graph.

@lazar @Vasilije1990 what do you think?