freelawproject / juriscraper

An API to scrape American court websites for metadata.
https://free.law/juriscraper/
BSD 2-Clause "Simplified" License
354 stars 105 forks source link

Update PACER parsers to include office code, judge initials, and defendant number #1093

Open mlissner opened 1 month ago

mlissner commented 1 month ago

In freelawproject/courtlistener#4240, we're adding these fields to the DB. Let's start parsing them out of the HTML too.

Our iquery parser is the top priority here, but really we'll want all of our parsers upgraded.

flooie commented 1 month ago

I would like to add one more thing. I think we should include case type as a code

flooie commented 1 month ago

we get the benefit of now being able to re-create the full docket number AND we can sort pacer cases by type - which we really should be able to do.

mlissner commented 1 month ago

You mean the docket letters?

flooie commented 1 month ago

-cv- -cr- -po- -mj- ?

mlissner commented 1 month ago

Yeah, OK.