org-SCAN / website

Site web du projet
6 stars 0 forks source link

[NEW FEATURE]Improve duplicate algorithm #466

Closed lduf closed 6 months ago

lduf commented 9 months ago

Find new algorithms to improve our duplicate detection. You can use ChatGPT to find some new ideas. Once done => Implement the solution within the Laravel project.

I think the best way to do :

=> this way you ll have the same architecture and the same function.

You ll also need to adapt the Duplicate model to call the newly created MatchingAlgorithm

github-actions[bot] commented 9 months ago

AI-Assisted Response:

Issue Summary: Enhancing Duplicate Detection with New Algorithms

The issue focuses on improving the duplicate detection system in a Laravel project by finding and integrating new algorithms. The proposal includes using resources like ChatGPT to brainstorm innovative algorithmic ideas. After identifying suitable algorithms, the next step is to implement those within the Laravel framework.

The suggested implementation strategy is to organize the codebase using object-oriented design principles:

  1. Create an abstract base class named MatchingAlgorithm.
  2. Derive a child class for each specific algorithm from this parent class.

This structure will provide consistency across different algorithms and facilitate straightforward integration with the system. The issue also mentions the need to update the Duplicate model in order to utilize the new MatchingAlgorithm base class.

Issue Categorization

This issue is classified as a feature request since it calls for new functionality—improved duplicate detection algorithms—to be added to the existing software.

Questions for Clarification

  1. @lduf, could you please provide more context on the current duplicate detection system? What algorithms are in use, and what are their limitations?
  2. Have you already identified specific algorithms that you would like to explore, or are you looking for suggestions from scratch?
  3. What criteria will you use to evaluate the new algorithms' effectiveness in duplicate detection?
  4. Is there a specific workflow or data structure the Duplicate model uses that the new algorithms should be compatible with?
  5. Could you clarify if there is an expected performance benchmark for the new algorithms compared to the current system?
  6. Is there a timeline for when you plan to have the new algorithms researched and potentially implemented?
github-actions[bot] commented 9 months ago

AI-Assisted Response:

Issue Summary: Enhancement of Duplicate Detection with New Algorithms

The issue suggests that there is potential for improving the system's ability to detect duplicates by introducing new algorithms. The user recommends utilizing ChatGPT for generating innovative ideas for such algorithms. Once identified, the proposed solution should be integrated into an existing Laravel project.

The user outlines a potential implementation strategy involving object-oriented programming:

Issue Categorization:

This is a feature request, as it proposes the addition of new functionality to improve an already present system within the project.

Questions for Clarification:

lduf commented 9 months ago

A potential solution :

Other phonetic algorithms :

Other algorithms to calculate distances :

create-issue-branch[bot] commented 9 months ago

Branch feature/issue-466-_NEW_FEATURE_Improve_duplicate_algorithm created!