open-austin / indigent-defense-stats

A web scraper for collecting and processing public case records from sites using Tyler Technology's Odyssey court records database software.
MIT License
15 stars 6 forks source link

Update the Texas County Data CSV #66

Open newswim opened 5 months ago

newswim commented 5 months ago

This file was last updated 2 years ago.

Scope

Go through the counties and validate that

  1. The portal URLs are still correct
  2. The version of Odyssey is still correct
  3. Other metadata also correct

Link: https://github.com/open-austin/azure-indigent-defense/blob/main/resources/texas_county_data.csv

tpadmanabhan commented 5 months ago

Review with Nick and Nidhi on 3/31

tpadmanabhan commented 5 months ago

Assigned to Nick. Target due date 4/13/24

tpadmanabhan commented 5 months ago

What is scope of counties here? @lianilychee

tpadmanabhan commented 4 months ago

@nicolassaw Nick, please prioritize counties based on what Fair Defense wants (Nate)

nicolassaw commented 4 months ago

I'm having a tough time finding the API end points for some of these Odyssey search engines. Anybody have any resources on seeing if there is an open API for a given online search function? I tried the Google chrome inspect>network>xhr process but that requires the page to load and make the get requests, but sometimes the links that make the requests open in a new window. Any ideas?

nicolassaw commented 4 months ago

I submitted a new version of the dataset that has the following changes: (1) Corrected URLs for some search pages. (2) Added two new fields per county, described below: --attys_high_caseloads_count_fy22: This represents the number of attorneys that disposed a number of appointed cases that would exceed an approximation of a national caseload standard for cases to work on within a year. --avg_atty_caseload_fy22: This represents the average number of appointed cases that an attorney in the jurisdiction reported closing within the fiscal year.

Possible Next Steps: (1) Review the remainder of the links to correct broken or inaccurate links to search pages. Focus on County and District court links only. (2) Consider if there are other data weighing factors to consider and add those fields to sheet.

Note: (1) As I review URLs, I noticed that some of the links provided for the county included search engines for Justice of the Peace Courts. They only preside over class C misdemeanors and you don't have a right to counsel in those cases. Instead, the search engine link provided should be a link to the adult misdemeanor and felony courts (also known as County and District Courts, respectively) because these are the courts in which you have a right to counsel.

nicolassaw commented 4 months ago

Should I upload the source data for the fields I added to the table?

It came from a public report on this page (https://tidc.tamu.edu/public.net/Reports/AttorneyCaseLoad.aspx) and I did some analysis on it.

newswim commented 4 months ago

Thanks for working on this, Nick! I saw your PR and added the two versions to this spreadsheet for spot checking. We're not actually using the .csv for anything right now, it's more of a reference for devs to use as we start adding more counties to the corpus.

I think it's totally fine to add any metadata that you think would be helpful.

Here's that spreadsheet: https://docs.google.com/spreadsheets/d/1YBuU4hKufhzA-K-ZXbJ42u04t5X82PBoAazSK8RcVoM/edit#gid=0