freelawproject / juriscraper

An API to scrape American court websites for metadata.
https://free.law/juriscraper/
BSD 2-Clause "Simplified" License
363 stars 107 forks source link

Enhancement: Improved Error Logging for scraping in Juriscraper #835

Open flooie opened 9 months ago

flooie commented 9 months ago

Introduction

This issue follows a discussion with @grossir, focusing on enhancing the error logging mechanism in Juriscraper, specifically to allow for better logging when a row or document is skipped.

Problem Statement

Currently, Juriscraper may skip certain rows during operations without providing clear and detailed logging. This lack of transparency can be particularly challenging for new users who are trying to understand and debug the process.

Proposed Enhancement

@grossir suggests enhancing the error logging system to include detailed reasons for why a row might be skipped. This enhancement should Log detailed error messages for each skipped row. We already provide custom error logging for Slow, and Insanity exceptions.

Benefits

Example Use Case

In a scenario where Juriscraper skips a row due to a data mismatch, the enhanced logging system would provide a clear message like: "Row skipped due to data format mismatch in column X."

Potential Challenges

Call to Action

Any feedback or suggestions are encouraged.

mlissner commented 9 months ago

Works for me!