Closed sashatrubetskoy closed 2 years ago
After thinking about it some more, I realized that allowing a duplicate page strictness slider opens a can of worms - since other results are dependent on the results of the duplicate pages. This means we would either have to re-run the python script if the threshold is changed, or we would have to use some fragile logic to filter results coming from the python, which would take a lot of time to develop. In any case, it is probably not worth the effort right now.
Settings panel should have the following structure:
Page
Show: [ ] 123 total matches
Text
Show: [ ] 2 total matches Min Text Characters: 1 Min Text Words: 1
Digit
Show: [ ] 22 total matches Min Digit Count: 1
Image
Show: [ ] 9 total matches
Importance Score
Min Score: 5