SuffolkLITLab / docassemble-MotionToStayEviction

A Docassemble interview for the Massachusetts Appeals Court Motion to Stay Eviction
https://www.courtformsonline.org/appeals/#stayeviction
MIT License
0 stars 0 forks source link

Figure out OCR performance #55

Closed BryceStevenWilley closed 1 year ago

BryceStevenWilley commented 1 year ago

With docassemble's built in OCRing, it looked like it was taking > 3 hours to OCR a 700 page PDF that I uploaded as an attachment. But when looking closer, it seemed to be consistently processing the pages over and over again.

Need to have some concrete answers on how longs things should take. We could always just turn off things for > 300 pages as an option.

Need to also figure out where to make users wait, since I assume they will want to know that their thing has been filed (I think now we are just silently using the non-OCR'd, which probably isn't a good idea).

BryceStevenWilley commented 1 year ago

Some compromises:

Nothing much else to do here. I'll close this with #61, but in general, we should keep an eye out for other ways to improve OCR and PDF performance.