freelawproject / juriscraper

An API to scrape American court websites for metadata.
https://free.law/juriscraper/
BSD 2-Clause "Simplified" License
357 stars 106 forks source link

fix(nh_p): handle edge cases #1153

Closed grossir closed 3 weeks ago

grossir commented 4 weeks ago

Related to #1129

Added 2 edge cases to the example files. Related to getting the document URL and the case date

flooie commented 4 weeks ago

I tested out the scraper for 2022 - and I got some edge cases from it.

Adding new item:
    case_dates: 2022-09-30
    case_names: "Appeal of Javier Vasquez and 2021-0072, Appeal of Matosantos International Corporation"
    download_urls: "https://www.courts.nh.gov/sites/g/files/ehbemt471/files/documents/2022-09/2022061vasquezandmatosantos.pdf"
    precedential_statuses: "Published"
...
    docket_numbers: "2021-0071"

 case_dates: 2022-01-05
    case_names: "& 2020-0313, In the Matter of James R. Britton and Patricia F. Britton"
    download_urls: "https://www.courts.nh.gov/sites/g/files/ehbemt471/files/documents/2022-01/2022001Britton.pdf"
    precedential_statuses: "Published"
    blocked_statuses: False
    date_filed_is_approximate: False
    docket_numbers: "2020-0029"
grossir commented 3 weeks ago

@flooie I updated the PR please check again