[Build] Data refreshment: Correcting our "censored" sample

Kevinlkc / legal_censorship

MIT License

0 stars 0 forks source link

With regard to the scale of censorship, I don't think we're missing anything by a magnitude if we trust the results given by Liebman et. al. (2022)

Let's do some back-of-the-envelope calculations: For instance, we only observe 1,014,143 civil lawsuits trailed in 2013, when there are 1,021,098 posted of the same year when we visit the website in 2021; 7,550,158 trialed in 2016, when there are 7,628,756 posted of the same year when we visit the website in 2013, that would be about 1%, which is still economically significant. The only things that blocks our making progress is that we might be called the wrong files "censored".

One more item to do maybe is to look at the total number of civil lawsuits that we scraped from 2019 and 2021, and see if the magnitudes match with what the paper presents us, and the website's current count.

Kevinlkc / legal_censorship

[Build] Data refreshment: Correcting our "censored" sample #6