freelawproject / juriscraper

An API to scrape American court websites for metadata.
https://free.law/juriscraper/
BSD 2-Clause "Simplified" License
343 stars 98 forks source link

multi-valued PACER referral strings not captured #206

Open johnhawkinson opened 6 years ago

johnhawkinson commented 6 years ago

In everyone's favorite case of the hour:

<td width="60%" valign="top"><br>
Waymo LLC v. Uber Technologies, Inc. et al<br>
Assigned to: Hon. William Alsup<br>
Referred to: Magistrate Judge Jacqueline Scott Corley<br>
&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;Magistrate Judge Elizabeth D. Laporte (Settlement)<br>

this is parsed as

pb3:juriscraper jhawk$  PYTHONPATH=`pwd` juriscraper/pacerdocket.py  juriscraper/pacer/appellate/02.06.03\ CAND-ECF.html 2>/dev/null  |tail -5
  "jury_demand": "Both",
  "nature_of_suit": "830 Patent",
  "parties": [],
  "referred_to_str": "Jacqueline Scott Corley"
}
pb3:juriscraper jhawk$ 

Omitting Judge Laporte.

mlissner commented 6 years ago

Oh no. Cases can have multiple referring judges? That's....cruel. Aside from fixing this in the parser, we'll need to update CourtListener to handle more than one judge.