gkamradt / SemanticDeduplicator

MIT License
87 stars 12 forks source link

Auto-Generate `background_context` Using LLMs for Better Deduplication #30

Open ayush-vibrant opened 1 year ago

ayush-vibrant commented 1 year ago

While the system currently allows the usage of the API without providing a background_context, this might not always yield optimal results, especially when the context is crucial for deduplication. (You've already highlighted this in the README.md)

As a fallback for situations where the background_context is not provided - should we consider implementing a mechanism that utilizes LLMs to infer or auto-generate a suitable context based on the items in the list? This inferred context can assist in achieving better deduplication outcomes.