WhichBrowser / Parser-PHP

Browser sniffing gone too far — A useragent parser library for PHP
http://whichbrowser.net
MIT License
1.8k stars 237 forks source link

Add missing regex for search engines and add to separate classes + detect FAKE bots #568

Closed summercms closed 4 years ago

summercms commented 4 years ago

Real Search Engine Bots

Fake Search Engine Bots

Below is example of fake Naver UA:

NaverBot

Real Naver bot uses Yeti see code line: https://github.com/WhichBrowser/Parser-PHP/blob/880b9fa797401d14b28956442944c3daa70240ff/data/applications-bots.php#L164

Link: https://searchadvisor.naver.com/guide/seo-basic-firewall

Add Fake Naver to Fake Bots PR.

Other Fake Bots

acebookexternalhit/1.0 (+http://www.facebook.com/externalhit_uatext.php)

Mozilla/5.0 (Macintosh; Intel Mac OS X 10_11_1) AppleWebKit/601.2.4 (KHTML, like Gecko) Version/9.0.1 Safari/601.2.4 facebookexternalhit/1.1 Facebot Twitterbot/1.0

facebookscraper/1.0( http://www.facebook.com/sharescraper_help.php)

archive.org_bot

AhrefsBot

LinkedInBot

Mozilla/5.0 (compatible; CloudFlare-AlwaysOnline/1.0;+http://www.cloudflare.com/always-online)