SOBotics / Guttenberg

A bot, searching for plagiarism on Stack Overflow.
http://stackapps.com/q/7197/43403
GNU General Public License v3.0
23 stars 10 forks source link

Google-search: Print on-site matches via SoBoticsPostPrinter #145

Open FelixSFD opened 6 years ago

FelixSFD commented 6 years ago

The output of the Google search is currently a dump of on-site and off-site links. It would be nice, if we could print the on-site matches with the SoBoticsPostPrinter, since this class will send the report to CopyPastor

Bhargav-Rao commented 6 years ago

Why not use Reports?

FelixSFD commented 6 years ago

That's a good idea, but it wouldn't send the posts to CopyPastor. Do we want to include results from the checkuser-command in our statistics?

Bhargav-Rao commented 6 years ago

Aren't the >0.80 detections already going through CP? I'm telling about the internet hits. See here for example https://chat.stackoverflow.com/transcript/message/40823852#40823852

FelixSFD commented 6 years ago

Yes, if they were found in the linked/related questions. The on-site results from Google are just added to the dump at the end

Bhargav-Rao commented 6 years ago

Are there any on-site results found from Google which have been more accurate than the Guttenberg report?

jdd-software commented 6 years ago

The optimal solution probably would be a report with links to copypastor probably with lower threshold and only 5 min later filter. The logic probably should be to integrate both related ans google search (as long as we have quota)