ooni / probe

OONI Probe network measurement tool for detecting internet censorship
https://ooni.org/install
BSD 3-Clause "New" or "Revised" License
758 stars 142 forks source link

data quality: investigate a bunch of measurements that might be false positives #1891

Open bassosimone opened 2 years ago

bassosimone commented 2 years ago

This issue collects a bunch of suspicious URLs. We should look into them and figure out why they are failing. We'll consider this issue closed when we've looked at them and documented what to do to address the underlying problem (I guess in websteps, since at this point we should not modify Web Connectivity significantly).

We've recently received a bunch of requests for checking URLs that might be false positives. I am not working today, but I would also like to track down those URLs before I forget, so I can take a look next week.

I marked these issues as "needs investigation and triage" because it may be that we need to split these URLs and assign them to more specific issues. The task here is to start off with classifying and triaging what's happening first.

agrabeli commented 2 years ago

Only very few anomalies are present in the dataset for this tested URL: https://explorer.ooni.org/measurement/20211120T232208Z_webconnectivity_IT_30722_n1_apE8ioEoVOjxf847?input=http%3A%2F%2Fwww.xenu.net%2F

See MAT: https://explorer.ooni.org/experimental/mat?probe_cc=IT&probe_asn=AS30722&test_name=web_connectivity&input=http%3A%2F%2Fwww.xenu.net%2F&since=2021-10-01&until=2022-02-08&axis_x=measurement_start_day

agrabeli commented 2 years ago

de.lirio.us has been removed from the global test list: https://github.com/citizenlab/test-lists/pull/832