flexion / ef-cms

An Electronic Filing / Case Management System.
23 stars 10 forks source link

SPIKE: File Size and Type Limitations #591

Closed JessicaMarine closed 5 years ago

JessicaMarine commented 5 years ago

What file size limitations (if any) should the Court impose on external filers? 500MB (#1166) Are there any common work-arounds (i.e., upload as multiple docs)? Post-MVP (#1165) Should the court impose file type limitations? Yes, only PDFs (#1164)

Upload Scenarios:

Issues:

Solution:

Notes/Considerations:

Tasks

waldoj commented 5 years ago

There needs to be some file size limitation. I'm not sure if it's even possible to have no limit for HTTP, but it's certainly undesirable — a bug could use up many gigabytes of storage before it was noticed. A review of over a million court documents as PDFs yielded none larger than 16 MB, with an average scanned page size at 54k. Note, however, that Alabama Middle District (an apparently random example) limits PDFs to 25M, which may be arbitrary, or perhaps it was based on actual experience.

There is no reason why the limitations can't be as large as is necessary.

Separately, there are page limits that may need to be enforced. This can be done on the client side, using pdf.js. Note, however, that this is a soft limit — we can't force their browser to adhere to this limit, so it remains necessary to have a hard limit coded into the server.

Something that we could reasonably do is scan in some documents at a few different settings and see what we get! Maybe print out a sample court filing and scan it in at a few different settings (black and white, grey scale, and color; TIFF or JPEG; ZIP-compressed and not), and also use a PDF-generation iOS app to do the same. The extremes of sizes that we see may be illustrative.

JessicaMarine commented 5 years ago

Some data from other Federal Courts:

image.png

This also provides some helpful information/instructional text: https://www.scd.uscourts.gov/CMECF/Troubleshooting/PageSizeLimits.pdf

JessicaMarine commented 5 years ago

Recommendation from Court's OIS team:

File size max: 500MB (#1166)

Files larger than 500MB should be broken down into multiple files each with a maximum of 500MB. The system should recognize the files (or volumes) as part of the larger document and make them available for viewing through a single docket entry (not one docket entry for each part). POST-MVP (#1165)

Session Timeouts (due to inactivity): (#837) For external users: 30 minutes. Include visual and audio messages as the timeout approaches requiring action on the user’s part in order to continue without being “timed-out”. For internal users: 1 hour. Include visual and audio messages as the timeout approaches requiring action on the user’s part in order to continue without being “timed-out”. Note: Document upload process should not be considered “activity” as far as the EF CMS system is concerned in relation to timeouts

Virus Scanning (#1164) In the existing CMS eFiled documents go through a virus scanning process in addition to an OCR process. Virus scanning should also be part of the new CMS.