freelawproject / juriscraper

An API to scrape American court websites for metadata.
https://free.law/juriscraper/
BSD 2-Clause "Simplified" License
378 stars 111 forks source link

Implement Lousiana Court of Appeals Circuits: 2nd, 3rd, 4th, 5th #1197

Open grossir opened 1 month ago

grossir commented 1 month ago

We only have a scraper for the First Circuit lactapp_1. There are more circuits which publish opinions. At first glance, the sites do not share structure, so each would need a different scraper

From the sample docket numbers, it seems we won't need docket number disambiguation

When we implement these, we should backscrape from July 17th, 2019 to present. Before that date, we have the opinions from different sources. For lactapp_1 there is gap from July 17th, 2019 to December 29th, 2021

Circuit URL Docket sample
1st 2024KW0648, 2024CW0694
2nd https://www.la2nd.org/opinions/ 55,772-CA
3rd https://www.la3circuit.org/index.aspx ; click on "Records", then Search Opinions KA -0023-0582, CA -0023-0017
4th https://www.la4th.org/Default.aspx ; click on Search, then Search Opinions 2023-K-0770, 2023-C-0816, 2023-CA-0664
5th https://www.fifthcircuit.org/searchopinions.aspx 23-KA-473
grossir commented 1 month ago

lactapp_5 is already working. Example