HTTPArchive / legacy.httparchive.org

<<THIS REPOSITORY IS DEPRECATED>> The HTTP Archive provides information about website performance such as # of HTTP requests, use of gzip, and amount of JavaScript. This information is recorded over time revealing trends in how the Internet is performing. Built using Open Source software, the code and data are available to everyone allowing researchers large and small to work from a common base.
https://legacy.httparchive.org
Other
328 stars 84 forks source link

Security 2021 custom metrics #219

Closed GJFR closed 3 years ago

GJFR commented 3 years ago

Progress on https://github.com/HTTPArchive/almanac.httparchive.org/issues/2150.

Also includes renaming and extending ecommerce custom metric for well-known URLs as per https://github.com/HTTPArchive/almanac.httparchive.org/issues/2211.

rviscomi commented 3 years ago

Is this ready for review?

max-ostapenko commented 3 years ago

@GJFR Could you please add, as ,emtioned in #211

// privacy
parseResponse('/.well-known/gpc.json', r => {
  return r.text().then(text => {
    let data = {
      'gpc': null
    };
    let gpc_data = JSON.parse(text);
    if (typeof gpc_data.gpc == 'boolean') {
      data.gpc = gpc_data.gpc;
    }
    return data;
  });
}),
GJFR commented 3 years ago

@max-ostapenko Your code has been added 👍

@rviscomi Thank you for your comments! I've updated the code. I'm gonna do a quick check and tweak the robots.txt data collecting. Will mark as ready for review ASAP.

GJFR commented 3 years ago

WPT test runs:

SaptakS commented 3 years ago

WPT tests look good to me.

GJFR commented 3 years ago

I filtered on all keywords discussed in this thread.

A few thoughts:

WPT test runs:

rviscomi commented 3 years ago

LGTM thanks everyone!