freelawproject / juriscraper

An API to scrape American court websites for metadata.
https://free.law/juriscraper/
BSD 2-Clause "Simplified" License
357 stars 106 forks source link

Fill `mass` and `massappct` gaps #984

Open grossir opened 5 months ago

grossir commented 5 months ago

Part of #929

mass

Between September 30th, 2021 and February 6th, 2023 we have 0 documents. According to FindLaw there are 261 Supreme Court opinions

massctapp

Between May 24th, 2021 and February 7th, 2023 we only have 8 opinions. According to FindLaw, there are 562.


This source doesn't list old opinions, one has to go to external providers (recommended by the state source itself) to find them.

One of these sites masscases.com has the reported opinions for the period we are missing, but I am not sure if it is desirable to use a non .gov source.

We could also find a way to re-build the mass.gov file link, which has some structure using the publication date and the docket number. This may be error prone. Examples 1, 2

We should solve this issue before trying freelawproject/courtlistener#3825

grossir commented 5 months ago

After talking about this with Bill, I will implement the backscraper on masscasess.com