arcjet / well-known-bots

List of well-known bots and user-agent patterns to detect them
MIT License
3 stars 0 forks source link

Compare list against isbot #2

Open blaine-arcjet opened 2 months ago

blaine-arcjet commented 2 months ago

We should compare the list of user-agents we match vs isbot to see if we are missing any.

blaine-arcjet commented 1 month ago

I went through the list of RegExps that isbot provides and removed everyone where we overlapped. It is worth noting that isbot used case-insensitive RegExps and we use case-sensitive here.

I've grouped them below.

Generic patterns: I think we'll discard all of these because they are so generic that we wouldn't be able to identify a specific bot.

Not user agents: I don't think these are actually user agents but need to double check.

Little to no overlap: We don't seem to match any of these or they overlap with a different pattern.