friends-of-presta / byebyebots

An 1.6/1.7 PrestaShop module to remove the logging of every bot connection.
MIT License
6 stars 1 forks source link

Ajout des nouveaux Bots (2024) #1

Closed dabcorp closed 7 months ago

dabcorp commented 7 months ago

Voici le script avec ajouts de bots en date du 10/01/2024 :

<?php


class Connection extends ConnectionCore
{
    /**
     * {@inheritDoc}
     */
    public static function setNewConnection($cookie)
    {
        // Original bot list
        $originalBotList = explode('|', 'BotLink|ahoy|AlkalineBOT|anthill|appie|arale|araneo|AraybOt|ariadne|arks|ATN_Worldwide|Atomz|bbot|Bjaaland|Ukonline|borg-bot/0.9|boxseabot|bspider|calif|christcrawler|CMC/0.01|combine|confuzzledbot|CoolBot|cosmos|Internet Cruiser Robot|cusco|cyberspyder|cydralspider|desertrealm, desert realm|digger|DIIbot|grabber|downloadexpress|DragonBot|dwcp|ecollector|ebiness|elfinbot|esculapio|esther|fastcrawler|FDSE|FELIX IDE|ESI|fido|Hämähäkki|KIT-Fireball|fouineur|Freecrawl|gammaSpider|gazz|gcreep|golem|googlebot|griffon|gulliver|gulper|hambot|havIndex|hotwired|htdig|iajabot|INGRID/0.1|Informant|InfoSpiders|inspectorwww|irobot|Iron33|JBot|jcrawler|Teoma|Jeeves|jobo|image.kapsi.net|KDD-Explorer|ko_yappo_robot|label-grabber|larbin|legs|Linkidator|linkwalker|Lockon|logo_gif_crawler|marvin|mattie|mediafox|MerzScope|NEC-MeshExplorer|MindCrawler|udmsearch|moget|Motor|msnbot|muncher|muninn|MuscatFerret|MwdSearch|sharp-info-agent|WebMechanic|NetScoop|newscan-online|ObjectsSearch|Occam|Orbsearch/1.0|packrat|pageboy|ParaSite|patric|pegasus|perlcrawler|phpdig|piltdownman|Pimptrain|pjspider|PlumtreeWebAccessor|PortalBSpider|psbot|Getterrobo-Plus|Raven|RHCS|RixBot|roadrunner|Robbie|robi|RoboCrawl|robofox|Scooter|Search-AU|searchprocess|Senrigan|Shagseeker|sift|SimBot|Site Valet|skymob|SLCrawler/2.0|slurp|ESI|snooper|solbot|speedy|spider_monkey|SpiderBot/1.0|spiderline|nil|suke|http://www.sygol.com|tach_bw|TechBOT|templeton|titin|topiclink|UdmSearch|urlck|Valkyrie libwww-perl|verticrawl|Victoria|void-bot|Voyager|VWbot_K|crawlpaper|wapspider|WebBandit/1.0|webcatcher|T-H-U-N-D-E-R-S-T-O-N-E|WebMoose|webquest|webreaper|webs|webspider|WebWalker|wget|winona|whowhere|wlm|WOLP|WWWC|none|XGET|Nederland.zoek|AISearchBot|woriobot|NetSeer|Nutch|YandexBot');

        // Extended bot list
        $extendedBotList = explode('|', 'acapbot|acoonbot|alexibot|asterias|attackbot|backdorbot|becomebot|binlar|blackwidow|blekkobot|blexbot|blowfish|bullseye|bunnys|butterfly|careerbot|casper|'
                                       . 'checkpriv|cheesebot|cherrypick|chinaclaw|choppy|clshttp|cmsworld|copernic|copyrightcheck|cosmos|crescent|cy_cho|datacha|demon|diavol|discobot|dittospyder|'
                                       . 'dotbot|dotnetdotcom|dumbot|econtext|emailcollector|emailsiphon|emailwolf|eolasbot|eventures|extract|eyenetie|feedfinder|flaming|flashget|flicky|foobot|fuck|'
                                       . 'g00g1e|getright|gigabot|go-ahead-got|gozilla|grabnet|grafula|harvest|heritrix|httracks?|icarus6j|jetbot|jetcar|jikespider|kmccrew|leechftp|libweb|liebaofast|'
                                       . 'linkscan|linkwalker|loader|lwp-download|majestic|masscan|miner|mechanize|mj12bot|morfeus|moveoverbot|netmechanic|netspider|nicerspro|nikto|ninja|nominet|nutch|'
                                       . 'octopus|pagegrabber|petalbot|planetwork|postrank|proximic|purebot|queryn|queryseeker|radian6|radiation|realdownload|remoteview|rogerbot|scan|scooter|seekerspid|'
                                       . 'semalt|siclab|sindice|sistrix|sitebot|siteexplorer|sitesnagger|skygrid|smartdownload|snoopy|sosospider|spankbot|spbot|sqlmap|stackrambler|stripper|sucker|surftbot|'
                                       . 'sux0r|suzukacz|suzuran|takeout|teleport|telesoft|true_robots|turingos|turnit|vampire|vikspider|voideye|webleacher|webreaper|webstripper|webvac|webviewer|webwhacker|'
                                       . 'winhttp|wwwoffle|woxbot|xaldon|xxxyy|yamanalab|yioopbot|youda|zeus|zmeu|zune|zyborg');

        // Combine both lists, remove duplicates, and create regex pattern
        $combinedBotList = array_unique(array_merge($originalBotList, $extendedBotList));
        $botPattern = '/' . implode('|', $combinedBotList) . '/i';

        if (isset($_SERVER['HTTP_USER_AGENT']) && preg_match($botPattern, $_SERVER['HTTP_USER_AGENT'])) {
            return false;
        }

        return parent::setNewConnection($cookie);
    }
}
mickaelandrieu commented 7 months ago

Pour info, ce module a été mergé dans PrestaShop directement grâce à Jean François Viguier.

dabcorp commented 7 months ago

Ok , merci