opawg / user-agents-v2

Comprehensive open-source collection of broadly-compatible regular expression patterns to identify and analyze podcast player user agents.
MIT License
44 stars 24 forks source link

user-agents-v2

Comprehensive open-source collection of broadly-compatible regular expression patterns to identify and analyze podcast player user agents.

Quick start

Given a HTTP User-Agent found in your podcast episode server logs, to find a deterministic entity match:

(Optional) If type is not bot, to additionally break down by device:

(Optional) If type is browser and you also have the HTTP Referer header in your logs, to additionally break down by known web apps:

Approach

This collection is an evolution of the original OPAWG User agent list, refactored in some ways and overlaid with ideas from the excellent Buzzsprout Podcast Agents Ruby gem.

Some of the goals of this collection:

Evolution

These patterns were initially created with a one-time automated transform of the original OPAWG User agent list, with the following transformations: