Living-with-machines / DeezyMatch

A Flexible Deep Learning Approach to Fuzzy String Matching
https://living-with-machines.github.io/DeezyMatch/
Other
139 stars 34 forks source link

Fix ranking metric documentation in candidateRanker #97

Closed mcollardanuy closed 2 years ago

mcollardanuy commented 3 years ago

Check lines 328-331 in candidateRanker.py (see here). Comment and code are not aligned: according to the comment, 0.99 is multiplied, but in the code it's 1.01. Change the comment accordingly:

elif ranking_metric.lower() in ["cosine"]:
  # 0.99 is multiplied to avoid issues with float numbers and rounding errors
  if query_candidate_pd["cosine_dist"].max() > (selection_threshold*1.01):
     break 
kasra-hosseini commented 2 years ago

Done https://github.com/Living-with-machines/DeezyMatch/pull/121