Closed Arko-Sengupta closed 2 months ago
@Arko-Sengupta, this pull request is being closed as none of the checkboxes have been marked. It is important that you go through the checklist and mark the ones relevant to this pull request. Please read the Contributing guidelines.
If you're facing any problem on how to mark a checkbox, please read the following instructions:
x
between the square bracket like so: [x]
NOTE: Only [x]
is supported so if you have put any other letter or symbol between the brackets, that will be marked as invalid. If that is the case then please open a new pull request with the appropriate changes.
Jaccard Similarity Algorithm
Overview
Introduces a New Implementation of the Jaccard Similarity Algorithm in the
JaccardSimilarity
class. The Jaccard Similarity is a classical metric used inNatural Language Processing
andInformation Retrieval
to measure the similarity between two sets based on their intersection and union.Key Features
Mathematical Foundation
Jaccard Similarity Formula:
where the result is expressed as a percentage, with 100% indicating identical sets and 0% indicating no overlap.
Usage
JaccardSimilarity
class provides a method to calculate the similarity between two strings. It includes:jaccard_similarity(str1, str2)
: Computes the Jaccard similarity between two input strings as a percentage.Error Handling
Robust Error Handling is implemented to ensure reliable calculations. Any issues, such as empty input strings, are raised with appropriate error messages and logged.
Benefits