gkamradt / SemanticDeduplicator

MIT License
87 stars 12 forks source link

Merging Strategy for Multiple Similar Items #31

Open ayush-vibrant opened 1 year ago

ayush-vibrant commented 1 year ago

I am not very clear about the current merging strategy for multiple similar items. I've read the code, but still have few doubts.

In my opinion, while the current system efficiently merges similar items, there's room to enhance the nuance and depth of the merged output, especially when several items closely match a new addition.

PS: I think currently this functionality is solved (in a proxy manner) by playing with threshold values, right?