uchicago-capp-30320 / CivicLens

Putting the public back in public commenting
https://civic-lens.org
GNU Affero General Public License v3.0
2 stars 1 forks source link

163 build out qa checks on document data and ideally docket and comment data #164

Closed GregoryHo88 closed 6 months ago

GregoryHo88 commented 6 months ago

Hi @andrewjtdunn , @rezarzky

1.) I was able to write a new method that fetches the comment count by each document open for comment 2.) I have also made some extensions to fetch_documents_by_date_ranges(start_date, end_date). I tried start_date = 2024-01-01 and end_date = 2024-05-12, Was able to paginate through without any problems and fetched ~33k documents, for which 1087 were open for comment. IMO this is a workaround fetching >5k documents. 3.) Using the new method (fetch_comment_count_by_documents()), there were 208 documents with comments.

However, the highest comment count was for "FTC-2024-0018-0001" with 5941 comments.

andrewjtdunn commented 6 months ago

Re #3 -- I think an ideal approach would be to compare the number of comments returned by your method to the number of comments in the comments table. That way we can have some automated QA

GregoryHo88 commented 6 months ago

Roger that, submitted new issue #169 . Will address this asap