FrancescoSTL / Site-Sonar

A browser extension which silently crowd-sources ad performance as you browse. Let's put an end to bad ads.
http://site-sonar.com
Mozilla Public License 2.0
9 stars 2 forks source link

Grab More Useful Data #13

Closed FrancescoSTL closed 8 years ago

FrancescoSTL commented 8 years ago

We're aware that currently, asset load time, ad host URL, and origin URL don't give us a whole lot to report on. The issue is trying to gather information that is both useful and privacy respecting. We don't want to hit a situation where logs allow someone to be uniquely identified.

Some thoughts for useful metrics we can gather:

  1. Unique page-visit ID for each group of assets in one page visit This will allow us to determine page performance by host to some degree of accuracy. One variable which may throw off our data here is the amount of time spent on a page. If a user lets only 1 asset of a potential 300 load, while 2 other users had 300 assets loaded, that throws off our average by quite a lot. Which is why it would be useful to grab the next data point.
  2. Time spent on page If we collect time spent on a page when we are grouping requests by page visit, we will be able to parse out page visits which were too short to grab a majority of the requests on said page.

(DONE) 3. Ad Network This determination can also happen server-side, so I'm unsure if we should be doing it in the extension. That said, we've got the list handy already in the extension.

More TBD

FrancescoSTL commented 8 years ago

Now that #15 is merged, we are grabbing the following:

  1. asset size
  2. asset ad network
  3. asset type (script, image, iframe, etc)
  4. a timestamp of when the asset was requested
  5. method (GET or POST)
  6. status code (200, 301, etc)

I'm going to leave this on to track as we continue to discuss what sort of data we want to and can collect

FrancescoSTL commented 8 years ago

It has been decided that we will be aggregating the above content and recording the following on our dashboard:

  1. Network by response time (avg and distribution)
  2. Network by file size (avg and distribution)
  3. Website by response time (avg and distribution) for top n Alexa sites
  4. Website by file size (avg and distribution) for top n Alexa sites

I'll keep this milestone open but remove the v1.0 tag, since we've determined where we want to go for v1 and that portion is already completed.

FrancescoSTL commented 8 years ago

Done with the above. We'll be open for suggestions when the time comes, but I'll close this for now.