freelawproject / recap

This repository is for filing issues on any RECAP-related effort.
https://free.law/recap/
12 stars 4 forks source link

Docket URLs can have multiple case numbers #221

Open mlissner opened 6 years ago

mlissner commented 6 years ago

Here's an example of such a URL, from Manafort/Gates cases:

https://ecf.dcd.uscourts.gov/cgi-bin/DktRpt.pl?190597;190598

You can make this by searching for 17-cr-201, then checking the second and third boxes.

Currently, these aren't handled at all, the regex should just skip over them.

As discussed in https://github.com/freelawproject/recap/issues/220, there are a few ideas to solve this:

Some choices:

  • Add a comment noting this uncertain case, maybe a diagnostic log.
  • Add ; to the caseid part of the regexp, i.e. /\?([\d;]+)(?:&.*)?$/ and then split the result on ;
  • Maybe just take the lowest caseid or the first caseid
  • Maybe allow adjacent caseids only

I'm not sure I have a good solution just now, but it's worth thinking about more.