Open mlissner opened 8 years ago
I think the aaron schwartz collections are all here: https://archive.org/search.php?query=%22pacer%20dump%22
@ellliottt Do you know anything about the data that you posted the link to? Looks like there's only data there for about 10 jurisdictions?
I know of so many sources of PACER data I need to start writing them down or else I'll forget them. If you know of or have others that we can incorporate, please reply and I'll add it to my checklist here.
[x] RECAP data from Internet Archive
Should be fairly straightforward. Metadata is about as good as anything.
[ ] Hastings briefs scanned by PRO.
Might be very hard. These are image PDFs with hOCR data on top of them. The text could thus be extracted, but we'd still have the original images in the PDFs that we'd want to do something with. The metadata for these is in CSV files, but there seems to be a fair number of items with missing data, e.g., lacking a date.
[ ] PACER Experiments by PRO.
This data contains files named summary.html and docket.html. It's purely metadata, but it'd still be great to get it in place.
[ ] The Aaron Swartz collection in #668
This exists somewhere in the world. Probably in Carl's hands. Worth it to see if it's available.
[x]
All free opinions from researcher that got in touch.This guy disappeared so we're doing it ourselves in #657.[x]
FdSys.No unique data here. Only free opinions, which we're getting in #657.[ ] USPTO PACER data in #664.
This is another source of free opinions/motions. The metadata is ok.