Charcoal-SE / SmokeDetector

Headless chatbot that detects spam and posts links to it in chatrooms for quick deletion.
https://metasmoke.erwaysoftware.com
Apache License 2.0
474 stars 182 forks source link

Incorrect detection of Non-English link in answer #248

Closed rschrieken closed 7 years ago

rschrieken commented 8 years ago

This answer was reported as Non English Link in Answer but I fail to see where the non-english is. The markup of the post was:

https://jsfiddle.net/TC2006/vcfjhg4m/[][1]

    Just something I made in JSFiddle.

    Hope it helps. :)

  [1]: https://jsfiddle.net/TC2006/vcfjhg4m/

Can this be fixed or explained why it qualifies for being reported?

angussidney commented 8 years ago

Link to reason in code, for reference:

https://github.com/Charcoal-SE/SmokeDetector/blob/master/findspam.py#L59

csnardi commented 8 years ago

The non-English link text here is an empty link text. I looked at this a while back; it's only been detected a couple times in the past few months, and it does seem somewhat strange to have no link text, so maybe this is a decent idea to detect. I'm not sure though.

rschrieken commented 8 years ago

@hichris1234 I'm not saying it shouldn't be detected but I had to hit edit to check the markdown to verify that there was not something hiding in there. So I'm fine with detecting this empty link text but I would love to not have that called non-english although technically empty is also non-english...

ArtOfCode- commented 7 years ago

I'm gonna close this, because we haven't had any recurrences.

angussidney commented 7 years ago

And I'm going to reopen this, because we literally just had another occurence of this about 20 minutes after you closed it:

http://chat.stackexchange.com/transcript/message/33657866#33657866

ArtOfCode- commented 7 years ago

Typical. Inconsiderate spammers, no thought for the common spam-detectors.

Bhargav-Rao commented 7 years ago

And another one https://metasmoke.erwaysoftware.com/post/47517