Understanding and comparing "exit Transfer Error Rate"

higher-order commented 2 months ago

What exactly is the "exit Transfer Error Rate" in the plot output by tornettools? In the "Once is Never Enough" paper (link), it says it's the fraction of failed over attempted downloads. But I don't know what those kinds of downloads are. Are they downloads that never succeeded, are they downloads that only succeeded after trying different circuits, or something else?

I'm also looking at a plot to compare two simulations. In one simulation, I let some Tor clients run additional programs to try to write sensible values to the v3bw bandwidth files, at a certain frequency throughout the simulation. In the other, I added a program that writes random values to the bandwidth files at a certain frequency. In the plot, the two simulations show similar "exit Client Transfer Goodput". But the one with random values shows a higher error rate. Does that mean the one with random values have a poorer performance? And is the goodput computed only among downloads that succeeded, or among all downloads?

I've attached the plot. The sensible simulation is called "orig2crowd", and the one with random values is called "dynamic random (failed)".

tornet.plot.pages.pdf

robgjansen commented 2 months ago

What exactly is the "exit Transfer Error Rate" in the plot output by tornettools? In the "Once is Never Enough" paper (link), it says it's the fraction of failed over attempted downloads. But I don't know what those kinds of downloads are. Are they downloads that never succeeded, are they downloads that only succeeded after trying different circuits, or something else?

The error rates are computed using tgen client data, specifically, the "perf" clients that download 50KB, 1MB, and 5MB files. These clients are added to the simulation so we can track how the network overall is performing. From the client perspective, it starts a download and either the download succeeds, or it fails. Partial downloads count as failures. Timeouts count as failures. Whatever happens inside the Tor circuits to cause failures is orthogonal.

Here is the code that computes the errors if you want the nitty gritty details: https://github.com/shadow/tornettools/blob/9716a8682483f363e4bc9f9503f6871536e7b846/tornettools/parse_tgen.py#L159

I'm also looking at a plot to compare two simulations. In one simulation, I let some Tor clients run additional programs to try to write sensible values to the v3bw bandwidth files, at a certain frequency throughout the simulation. In the other, I added a program that writes random values to the bandwidth files at a certain frequency. In the plot, the two simulations show similar "exit Client Transfer Goodput". But the one with random values shows a higher error rate. Does that mean the one with random values have a poorer performance? And is the goodput computed only among downloads that succeeded, or among all downloads?

Yes your intuition here is correct. The Client Transfer Goodput metric is computed among succeeded 5 MB downloads, following the Tor Project's way of calculating it (from the metrics.torproject.org site). In your "bad" simulation (where you set random weights), probably many downloads are failing so yes that means much worse performance in general. But some downloads get lucky and find a fast enough circuit to succeed, and for those we compute the Client Transfer Goodput. This is probably why this metrics looks similar across your experiments.

Details: https://github.com/shadow/tornettools/blob/9716a8682483f363e4bc9f9503f6871536e7b846/tornettools/parse_tgen.py#L92

You are of course welcome to change the tornettools parsing scripts if you'd prefer to compute the metrics differently, or to compute different metrics. These are just the ones we found useful, but who are we to tell you how to do your research!? :)

higher-order commented 2 months ago

@robgjansen Thanks, that really cleared it up. One point of curiosity: The "exit Transfer Time" has sizes of 50KiB, 1MiB, and 5MiB, just like in the case of the error rate, as you mentioned. But why does the "exit Client Transfer Goodput" have sizes of "0.5 to 1MiB" and "4 to 5MiB", which don't match the previous ones?

robgjansen commented 2 months ago

We follow the Tor Project's method of calculating client goodput, which only calculates goodput over the bytes transferred in those sections of the downloads. I think the reasoning is that it can take some time for a download to get "up to full speed", so we should not count the beginning of downloads when computing goodput. Thus, for the purposes of the metric, 50 KB downloads are completely ignored. The "0.5 to 1MiB" range is available on 1 MB downloads and on 5 MB downloads, and the "4 to 5MiB" range is only available on 5 MB downloads.

There are some links in the comments below where you can find more information on Tor's client goodput metric here: https://github.com/shadow/tornettools/blob/9716a8682483f363e4bc9f9503f6871536e7b846/tornettools/parse_tgen.py#L92-L96

You really should carefully read this one, the default tornettools scripts are trying to apply a similar methodology:

https://metrics.torproject.org/reproducible-metrics.html#performance

higher-order commented 2 months ago

That explains it, thanks!

shadow / tornettools

Understanding and comparing "exit Transfer Error Rate" #109