freelawproject / courtlistener

A fully-searchable and accessible archive of court data including growing repositories of opinions, oral arguments, judges, judicial financial records, and federal filings.
https://www.courtlistener.com
Other
553 stars 151 forks source link

Integrate Courts-db into Courtlistener #1230

Open flooie opened 4 years ago

flooie commented 4 years ago

We have a brand new project courts-db that we should start using to identify courts in CL.

quevon24 commented 1 year ago

@mlissner @flooie I've inspected courtlistener source code to try to find where can we use courts-db, but couldn't find a place where we could implement it or replace something with courts-db, do you have any idea where we should use the library?

flooie commented 1 year ago

Technically, we already started using it ...

cl/corpus_importer/management/commands/harvard_opinions.py

mlissner commented 1 year ago

I think the idea is to remove this file: https://github.com/freelawproject/courtlistener/blob/4ca2d63f9432603a6e33555bf03677580dd596f7/cl/corpus_importer/court_regexes.py https://github.com/troglodite2/courtlistener/blob/4ca2d63f9432603a6e33555bf03677580dd596f7/cl/corpus_importer/court_regexes.py

I haven't looked at how it's used. Want to report back on that, and then we can decide if we want to replace functionality or just remove it?

quevon24 commented 1 year ago

Technically, we already started using it ...

cl/corpus_importer/management/commands/harvard_opinions.py

Yes, besides harvard opinion imported, is there anywhere else we should use the library? I used it in lexis and westlaw citation importer

flooie commented 1 year ago

@quevon24 I think @mlissner pointed out that we should see if we can remove the giant regexes in the corpus importer

mlissner commented 1 week ago

@flooie, can we analyze this and maybe do it. Seems like an easy win?