freelawproject / juriscraper

An API to scrape American court websites for metadata.
https://free.law/juriscraper/
BSD 2-Clause "Simplified" License
357 stars 106 forks source link

Parsing issue: AssertionError: Line with no boldface: 'Linda R. Anderson, referral' #788

Open sentry-io[bot] opened 10 months ago

sentry-io[bot] commented 10 months ago

Sentry Issue: COURTLISTENER-5PQ

_RemoteTraceback: 
"""
Traceback (most recent call last):
  File "/usr/local/lib/python3.11/concurrent/futures/process.py", line 261, in _process_worker
    r = call_item.fn(*call_item.args, **call_item.kwargs)
        ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
  File "/opt/courtlistener/cl/recap/tasks.py", line 872, in parse_case_query_page_text
    return report.data
           ^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/juriscraper/pacer/case_query.py", line 308, in data
    data = self.metadata.copy()
           ^^^^^^^^^^^^^
  File "/usr/local/lib/python3.11/site-packages/juriscraper/pacer/case_query.py", line 205, in metadata
    raise AssertionError(f"Line with no boldface: '{line}'")
AssertionError: Line with no boldface: 'Linda R. Anderson, referral'
"""
AssertionError: Line with no boldface: 'Linda R. Anderson, referral'
(9 additional frame(s) were not displayed)
...
  File "cl/recap/views.py", line 59, in perform_create
    await asyncio.shield(recap_upload_task)
  File "cl/recap/tasks.py", line 133, in process_recap_upload
    docket = await process_case_query_page(pq.pk)
  File "cl/recap/tasks.py", line 899, in process_case_query_page
    data = await asyncio.get_running_loop().run_in_executor(
sentry-io[bot] commented 8 months ago

This one seems related; only the assertion line is different:

f"Unable to match judge row: {line}"

Sentry issue: COURTLISTENER-2YA

Linked by: @albertisfu