freelawproject / courtlistener

A fully-searchable and accessible archive of court data including growing repositories of opinions, oral arguments, judges, judicial financial records, and federal filings.
https://www.courtlistener.com
Other
548 stars 150 forks source link

Improve matching of DocketNumber formats in Search #3860

Open albertisfu opened 8 months ago

albertisfu commented 8 months ago

In elasticsearch, we currently address the scenario where a docket number is indexed as 1:24-bk-12345, allowing it to be matched by a search query like 24-12345 using a proximity query, when we detect that the query includes a docket number.

A user reported an issue where it was not possible to find a docket where its docket_number was indexed in the other format, 24-12345, and the query used was 1:24-bk-12345.

Therefore, we should consider this other scenario to ensure that results can be accurately matched.

mlissner commented 8 months ago

Thanks for filing. This isn't a priority since it has an easy workaround and has been bad for a decade. I'll slot it into your backlog later.