This branch contains the implementation of the deduplication
The function deduplicate_search_results() in merger.py receive the final search result from main.py.
Currently only takes publications, sets the similarity threshold, vectorizes each publication using the title and authors, and finally returns publications by comparing their similarity score with the threshold value.
This branch contains the implementation of the deduplication The function deduplicate_search_results() in merger.py receive the final search result from main.py. Currently only takes publications, sets the similarity threshold, vectorizes each publication using the title and authors, and finally returns publications by comparing their similarity score with the threshold value.