[COG-538] Identifying industry standard retrieval algorithms to implement

hajdul88 commented 6 hours ago

I will double check when implementing the retrievers. To include it everywhere would be a bigger task, will see what is the state and update beginning of next week.

hajdul88 commented 6 hours ago

I thought it was there, maybe it got dropped in some refactor. Please include it if you can. Our initial concept I did had it

hajdul88 commented 6 hours ago

Here I was thinking and reading research papers. I think we will need to have embeddings of each nodes as well as each edges. Most of the non LLM based retrievals rely on node and edge importances that are ultimately vector similarities to the query. As I know for now we have embeddings only for the documentchunks.

For this cycle I can do embeddings of these on the fly but its not gonna be production ready but will work for testing purposes, later these will have to be moved to cognify pipeline and done when we are creating the graph.

@lazar @Vasilije1990 what do you think?

topoteretes / cognee

[COG-538] Identifying industry standard retrieval algorithms to implement #225