freelawproject / courtlistener

A fully-searchable and accessible archive of court data including growing repositories of opinions, oral arguments, judges, judicial financial records, and federal filings.
https://www.courtlistener.com
Other
550 stars 151 forks source link

Update cap crosswalk command #4594

Closed quevon24 closed 1 month ago

quevon24 commented 1 month ago

The command has been updated to improve execution time, now it uses an exact match for filepath_json_harvard field to be able to use the filepath_json_harvard index (I need to check that it exists).

Originally the command used icontains to match cap id to a cluster but the problem is that the index is not being used which slows down the command, more details can be seen in the issue: https://github.com/freelawproject/infrastructure/issues/170

I also removed some duplicate log messages

mlissner commented 1 month ago

LGTM, but I can't merge b/c it's still a draft.

quevon24 commented 1 month ago

LGTM, but I can't merge b/c it's still a draft.

Yes, I noticed that in this command we did not have a parameter to start from a specific reporter, I added it, it is now ready.

sentry-io[bot] commented 1 month ago

Suspect Issues

This pull request was deployed and Sentry observed the following issues:

Did you find this useful? React with a 👍 or 👎