HTTPArchive / cwv-tech-report

Core Web Vitals Technology Report
https://cwvtech.report
28 stars 2 forks source link

Coalesce data for technologies that have multiple names #12

Open rviscomi opened 2 years ago

rviscomi commented 2 years ago

When a technology is renamed in Wappalyzer, we end up with two or more names for the same thing. In the dashboard, this results in two distinct entries in the technologies list that when selected will appear as two separate line segments with different colors rather than one continuous one. One example of this is the recently renamed web-vitals technology. (cc @philipwalton)

Create a process to coalesce the old and new names together under the new name. This will help deduplicate the technologies list and make it easier to track changes over time.

The process may be to keep a map of old to new names in the SQL so that when appending a new month of data or backfilling previous months, the dashboard table will always store the newest name. The underlying technologies tables are snapshots in time of the Wappalyzer output and are probably best left as-is, but an alternate approach may be to overwrite the old tables to update the old names.

Any technologies that are known to have been renamed should be listed in this issue.