Open itsrachelfish opened 9 years ago
this sounds like a job for FUZZY HASHING
Nah, easier solved with "stripColorsAndStyle" from https://github.com/fent/irc-colors.js
@edwin-pers Your "stripColorsAndStyle" solution would not solve the example given.
@le1ca Thank you for the tip, I found a fuzzy hashing lib for node.js: https://github.com/huwenshuo/ctph.js
:+1:
By including special characters (color codes, bold, etc.) or only making small changes like adding a space or exclamation mark, it is possible to bypass fishy's triplicate detection.
Fishy should strip special characters from messages and do a text comparison of the most recent lines to make sure they don't have repeating sections. For example, the following messages should trigger triplicate detection even though they aren't exact matches: