freelawproject / juriscraper

An API to scrape American court websites for metadata.
https://free.law/juriscraper/
BSD 2-Clause "Simplified" License
369 stars 110 forks source link

`ca11` oral argument scraper broken #1178

Closed grossir closed 1 month ago

grossir commented 1 month ago

The problem is <br> tags inside the case name <td> , which makes the scraper return more entries than there are. Easiest solution is to refactor the scraper to OralArgumentSiteLinear

<td headers="view-field-oar-case-name-long-table-column" class="views-field views-field-field-oar-case-name-long">United States v. Jimika Williams, Appellant<br>
<br>
          </td>
sentry-io[bot] commented 1 month ago

Sentry Issue: COURTLISTENER-84T

grossir commented 1 month ago

This is now working, but there is a small gap of 4 oral args for 2024-09-13. #929