Problems downloading docs in chrome, with recap enabled...

Summary: Get a spinning beach ball in chrome on mac when trying to download docs with recap enabled.

Details: While it appears that recap can successfully upload the docket (when I view it in pacer, using chrome with the recap extension), the last several times (over a couple weeks, I'm an occasional user) that I have tried to download docs from the dockets I am interested in (CA bankruptcy court) the downloads to NOT complete (spinning beach ball).

Example docket I'm trying to download from: https://www.courtlistener.com/docket/67784556/the-litigation-practice-group-pc-adversary-proceeding/ https://ecf.cacb.uscourts.gov/cgi-bin/DktRpt.pl?759118030653799-L_1_0-1

Note: the one I'm reporting this bug on is a 6 part document (I wonder if this is related to multi-part documents)...

The "solution" for me has been to disable the recap extension, then the downloads work as expected.

When I open the developer tools and look at the console messages I see the following: RECAP: Successfully submitted zip file request Uncaught (in promise) TypeError: Cannot read properties of undefined (reading 'src') at extractUrl (content_delegate.js:548:22) at ContentDelegate.onDownloadAllSubmit (content_delegate.js:592:18)

So, somewhere in here:

// fetch the html page which contains the link to the zip document. const htmlPage = await browserSpecificFetch(event.data.id).then((res) => res.text()); console.log('RECAP: Successfully submitted zip file request'); const zipUrl = extractUrl(htmlPage); //download zip file and save it to chrome storage const blob = await fetch(zipUrl).then((res) => res.blob()); const dataUrl = await blobToDataURL(blob); console.info('RECAP: Downloaded zip file'); // save blob in storage under tabId // we store it as an array to chunk the message await updateTabStorage({ [this.tabId]: { ['zip_blob']: dataUrl }, });</p> <p>...</p> <p>// TODO: Confirm that zip downloading is consistent across jurisdictions ContentDelegate.prototype.onDownloadAllSubmit = async function (event) { // helper function - extract the zip by creating html and querying the frame const extractUrl = (html) => { const page = document.createElement('html'); page.innerHTML = html; const frames = page.querySelectorAll('iframe'); return frames[0].src; };</p> </div> </div> <div class="comment"> <div class="user"> <a rel="noreferrer nofollow" target="_blank" href="https://github.com/mlissner"><img src="https://avatars.githubusercontent.com/u/236970?v=4" />mlissner</a> commented <strong> 9 months ago</strong> </div> <div class="markdown-body"> <p>Thanks for the useful bug report. @ERosendo can you please take a look at this when you have a moment away from the ACMS stuff and the Elastic stuff? </p> <p>@el-abcd, you might be able to just disable uploading from within RECAP's settings. That puts it in "mooch mode" which I think would at least let you get content for free until this is fixed.</p> </div> </div> <div class="comment"> <div class="user"> <a rel="noreferrer nofollow" target="_blank" href="https://github.com/el-abcd"><img src="https://avatars.githubusercontent.com/u/39447831?v=4" />el-abcd</a> commented <strong> 9 months ago</strong> </div> <div class="markdown-body"> <p>Thx.</p> <p>I did try downloading the six docs above "one at a time" and that did work (clunky, end up opening multiple tabs, etc.). But I think that might help confirm this is related to "multi-part docs". I'll try disabling uploads when I do more "multi-part" downloads. </p> <p>The successfully uploaded docs (that I fetched one at a time) do show up here: <a href="https://www.courtlistener.com/docket/67784556/the-litigation-practice-group-pc-adversary-proceeding/">https://www.courtlistener.com/docket/67784556/the-litigation-practice-group-pc-adversary-proceeding/</a></p> </div> </div> <div class="comment"> <div class="user"> <a rel="noreferrer nofollow" target="_blank" href="https://github.com/ERosendo"><img src="https://avatars.githubusercontent.com/u/55959657?v=4" />ERosendo</a> commented <strong> 7 months ago</strong> </div> <div class="markdown-body"> <p>@mlissner Upon examination, the issue seems to be related to the <code>extractUrl</code> helper function. This function attempts to extract the zip URL from the response generated after the form is submitted. However, it employs a query to an <code>iframe</code> within the response, which fails in the absence of this HTML element (as encountered in docket <a href="https://www.courtlistener.com/docket/67784556/the-litigation-practice-group-pc-adversary-proceeding/">8:23-ap-01098</a>).</p> <p>I inspected the response received after the form submission and noticed it contained only a small number of HTML tags. The following is the body of the HTML page we get:</p> <pre><code class="language-html"><div id="cmecfMainContent"> <br>&nbsp;Your download will begin in a separate window. <div id="scroller" style="height: 100%; width: 100%; overflow: scroll; -webkit-text-size-adjust:none; resize: both;"> <script language="javascript" type="text/javascript"> window.location = "/cgi-bin/show_temp.pl?file=zipped_0.352834029679315.zip&type=application/zip&filename_prompt=8-23-ap-01098-SC.zip"; </script> </div></code></pre> <p>I was able to download the zip file by using the URL embedded within the <code>script</code> tag. This suggests that we could fix this issue by tweaking the <code>extractUrl</code> to get the URL from the <code>script</code> tag</p> </div> </div> <div class="comment"> <div class="user"> <a rel="noreferrer nofollow" target="_blank" href="https://github.com/el-abcd"><img src="https://avatars.githubusercontent.com/u/39447831?v=4" />el-abcd</a> commented <strong> 7 months ago</strong> </div> <div class="markdown-body"> <p>FWIW, this LGTM!</p> <p>I forked the repo, git clone'd it locally, and followed the instructions on how to install locally for chrome (tweak versions in package.json and manifest.json, make the zip, and "load unpacked" in chrome. </p> <p>I tested by downloading and uploading this multi-doc zip file: <a href="https://www.courtlistener.com/docket/67051161/the-litigation-practice-group-pc/?filed_after=&filed_before=&entry_gte=&entry_lte=&order_by=desc#entry-511">https://www.courtlistener.com/docket/67051161/the-litigation-practice-group-pc/?filed_after=&filed_before=&entry_gte=&entry_lte=&order_by=desc#entry-511</a></p> <p>and it worked as expected (including the message mentioning a download was in progress). </p> <p>Thanks! Eric</p> </div> </div> <div class="comment"> <div class="user"> <a rel="noreferrer nofollow" target="_blank" href="https://github.com/mlissner"><img src="https://avatars.githubusercontent.com/u/236970?v=4" />mlissner</a> commented <strong> 7 months ago</strong> </div> <div class="markdown-body"> <p>Thanks again for the helpful bug report.</p> </div> </div> <div class="page-bar-simple"> </div> <div class="footer"> <ul class="body"> <li>© <script> document.write(new Date().getFullYear()) </script> Githubissues.</li> <li>Githubissues is a development platform for aggregating issues.</li> </ul> </div> <script src="https://cdn.jsdelivr.net/npm/jquery@3.5.1/dist/jquery.min.js"></script> <script src="/githubissues/assets/js.js"></script> <script src="/githubissues/assets/markdown.js"></script> <script src="https://cdn.jsdelivr.net/gh/highlightjs/cdn-release@11.4.0/build/highlight.min.js"></script> <script src="https://cdn.jsdelivr.net/gh/highlightjs/cdn-release@11.4.0/build/languages/go.min.js"></script> <script> hljs.highlightAll(); </script> </body> </html>

freelawproject / recap

Problems downloading docs in chrome, with recap enabled... #352