Added account deactivation on spam detection of profile/bio updates

CodexVeritas commented 17 hours ago

Issue: https://github.com/Metaculus/metaculus/issues/1386 New spam detection flow:

Someone makes an account
They come to edit their profile
When they submit a profile edit (i.e. with the bio section) the information is passed to a spam check
The spam check always passes if the user has been around for at least 7 days
Otherwise GPT Analyzes the bio and decides whether its spam
If spam is detected, there is an alert given the the user on the front end, an email is sent, and the user is logged out after 10sec

Identification rates:

I tested on the ~300 profiles that has at least one comment or prediction and has a bio. The spam detection getting 16% positives on these. A number of these were spam in and of themselves, so I'm putting the true false-positive rate around 2-7%. The first iteration was at ~35%.
I tested on a smaller set of verified spam and the identification is getting 100% on the spam profiles

CodexVeritas commented 17 hours ago

@lsabor You mentioned that the 2-10% false positive rate was too high. Should I shoot for a 1-3%? There are some profiles which are hard to distinguish from spam. I'm guessing so few people change their bio that this is probably an ok rate (and people will understand why they were flagged when they are). Only 300 have bios and a comment/prediction of our 210,000 users (granted probably ~50% of those users are spam).

For reference, we technically have 210k users, 71k users with bios, 25k users have forecasted. Only ~300 have bios AND have forecasted/commented. Given this, I'm guessing that ~50% of users are spam. Over the lifetime of Metaculus this would have resulted in 6-30 false deactivations who could email us to remedy the situation (probably 50% from people doing something stupid like throwing html into their bio, or making something legit sound fishy). And actually this would be more like 0-10 since they might have changed their bios after the 7 day range

Though more iterations couldn't hurt, just curious how much of a difference it will make since they can reach out if there is a problem.

CodexVeritas commented 17 hours ago

Also looks like I need to apply flake8 formatting. Let me fix that real quick.

CodexVeritas commented 17 hours ago

@lsabor Who would I invite to request review? I should invite Sylvain for the final QA review because he assigned the ticket to me, but for the preliminary review, will just anyone from the team do?

Metaculus / metaculus

Added account deactivation on spam detection of profile/bio updates #1484