We calculate a topic distribution for each document(1) and calculate topic distribution for the given input text (2) with keywords. We need to find similar documents in the matrix of topic/document distribution(1) using the calculated distribution of the given input(2). We used cosine distance in Milestone_1_W_Relevant_Data and Milestone_1 to find similar documents.
Document how this logic works with clear matrix examples:
We calculate a topic distribution for each document(1) and calculate topic distribution for the given input text (2) with keywords. We need to find similar documents in the matrix of topic/document distribution(1) using the calculated distribution of the given input(2). We used cosine distance in Milestone_1_W_Relevant_Data and Milestone_1 to find similar documents.
Document how this logic works with clear matrix examples:
TODOS: