remove season references from end
regexp.MustCompile((?i) (s|season|season ){1}\d{1,2}e?\d{0,2}$)
Similarity
Use github.com/hbollon/go-edlib
OSADamerauLevenshtein
Lcs
Cosine
Jaccard
SorensenDice
Qgram
Apply similarity targets to min, median and max of these measures.
This reduces false positives and false negatives from use of levenshtein distance of 5.
false negatives distance > 5
false positives distance < 5
interaction of measures
proposed solution
Change bitmagnet to have a configuration such that a proxy is trusted. Proxy has to have these built in similarity checks and only returns one result in array if it passes similarity checks outline above
Is your feature request related to a problem? Please describe
I have found the following with TMDB search
query
attribute works better if pre-cleaned, for both getting search results and performing similarity checksdistance.levenshtein
Describe the solution you'd like
Cleaning
regexp.MustCompile(
^[a-zA-Z\d ]* ([a-z]{2,3}|Com|COM|TO|NZ|Org) [- ]{1})
regexp.MustCompile("(?i)[^\x00-\x7F]+")
regexp.MustCompile(
(?i) (s|season|season ){1}\d{1,2}e?\d{0,2}$)
Similarity
Use
github.com/hbollon/go-edlib
Apply similarity targets to
min
,median
andmax
of these measures.This reduces false positives and false negatives from use of levenshtein distance of 5.
false negatives distance > 5
false positives distance < 5
interaction of measures
proposed solution
Change bitmagnet to have a configuration such that a proxy is trusted. Proxy has to have these built in similarity checks and only returns one result in array if it passes similarity checks outline above
levenshteinCheck()
is only applied ifSimilarityCheck
is true