hypothesis / product-backlog

Where new feature ideas and current bugs for the Hypothesis product live
118 stars 7 forks source link

URLs that do not work correctly in Via or the LMS app #1270

Open mkdir-washington-edu opened 2 years ago

mkdir-washington-edu commented 2 years ago

Bug report form

See: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1642175183152600?thread_ts=1641839979.048300&cid=C2BLQDKHA

tl;dr We see Via users (especially instructors using the LMS app) not always anticipating the issues they may get on URLs they want to annotate. Since this is largely about LMS users the chrome extension and bookmarklet are not viable options.

This issue will be to track "problem URLs"; the reasons might be log ins, our proxy being blocked, content within iframes, and other reasons.

Support folks adding URLs to this form, please list out:

Support can help keep this list useful by, of course, vetting these sites and make sure the user is reasonable in what they're linking to. Reasonable would be a site where a user can see the text they would expect students to annotate.

Template for comments: URL:

Support ticket:

Via behavior:

(optional) Slack convo:

mkdir-washington-edu commented 2 years ago

URL: https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(21)02870-1/fulltext

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/732776255/

Via behavior: Reported by the user:

Many students and he himself are receiving the error “viahtml3.hypothesis.is redirected you too many times.” Sometimes, the page the instructor is trying to use hypothesis on won't load, even though the hypothesis add on will. Finally, the instructor occasionally gets the following Error: "Via Error None Error Details:Internal Error: 'wsgi.url_scheme'"

mkdir-washington-edu commented 2 years ago

URL: https://design-justice.pubpub.org/pub/ap8rgw5e/release/1

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/733849931/

Via behavior: After creating an annotation the page gives the error:

An error has occured in the editor. We've logged the error and will look into the cause right away. All previous changes have been successfully saved. To continue editing, please refresh the page.

Related: https://github.com/hypothesis/support/issues/210

mkdir-washington-edu commented 2 years ago

URL: https://www.independent.co.uk/life-style/aren-t-you-happy-me-1621301.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/667004825/

Via behavior: The site automatically forwards to https://www.independent.co.uk/life-style/aren-t-you-happy-for-me-1621301.html, but annotations created before the forwarding and new URL were set up are associated with the first URL, so they are no longer accessible in the context of the LMS app.

mkdir-washington-edu commented 2 years ago

URL: https://www.nydailynews.com/opinion/ny-oped-teach-public-school-kids-the-classics-20220104-kuynjw2alvdnvlatmv3tnvfjie-story.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/725595995/

Via behavior: "page unresponsive" - site often fails to load.

mkdir-washington-edu commented 2 years ago

URL: not specific, not New York Times websites

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/728305781/

Via behavior: User reports:

New York Times paywall did not used to cause problems, but even when I am logged into my NYT account, I now cannot get access to any articles via Hypothesis in Canvas. When I am invited to login, it seems that I can enter my nyt credentials, but then the captcha fails.

mkdir-washington-edu commented 2 years ago

URL: http://rhetoric.byu.edu/

Support ticket: none

Via behavior: Adder doesn't appear when selecting text

Slack convo: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1641947154097300

mkdir-washington-edu commented 2 years ago

URL: https://via.hypothes.is/https://journal.transformativeworks.org/index.php/twc/article/view/518/428

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/732623429/

Via behavior: Adder doesn't appear when selecting text

Slack convo: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1641839979048300

mattdricker commented 2 years ago

URL: https://via.hypothes.is/https://cvc.cervantes.es/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/477078696/

Via behavior:

UpstreamServiceError: (‘Connection aborted.’, ConnectionResetError(104, ‘Connection reset by peer’)) URL: https://via.hypothes.is/https://cvc.cervantes.es Third party URL: None

125513356-a3db8224-e246-407d-92bd-266ed470d39d

Related: https://github.com/hypothesis/support/issues/215#issue-943763801

mattdricker commented 2 years ago

URL: https://www.dhm.de/lemo/kapitel/weimarer-republik/alltagsleben.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/757452373/

Via behavior:

UpstreamServiceError: HTTPSConnectionPool(host='www.dhm.de', port=443): Max retries exceeded with url: /lemo/kapitel/weimarer-republik/alltagsleben.html (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1131)')))

URL: https://via.hypothes.is/https://www.dhm.de/lemo/kapitel/weimarer-republik/alltagsleben.html

Third party URL: https://www.dhm.de/lemo/kapitel/weimarer-republik/alltagsleben.html

Screen Shot 2022-01-28 at 4 11 35 PM
mkdir-washington-edu commented 2 years ago

Ignore this. See Rob's comment below.

URL: https://musicbrainz.org/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/778913126/

Via behavior: Page font gets larger. Happens with Bookmarklet as well.

robertknight commented 2 years ago

Via behavior: Page font gets larger. Happens with Bookmarklet as well.

Can you pull this one out into a separate issue. It looks like this relates to some PDF.js-specific changes that happen to be loaded in normal web pages as well. Two things we need to do here:

  1. Not load the pdfjs-overrides.css file outside of PDF.js (see https://github.com/hypothesis/client/pull/4224)
  2. Continue looking for solutions to the adder REM-sizing issue that don't affect the host page.

CC @lyzadanger (this can wait until you are back)

mkdir-washington-edu commented 2 years ago

@robertknight Thank you! https://github.com/hypothesis/product-backlog/issues/1317

mkdir-washington-edu commented 2 years ago

URL: https://opentext.uleth.ca/apex-video/sec_disk.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/781988657

Via behavior: embedded javascript in the page doesn't work in Via. From the user:

The problem seems to be that the app is interfering with the Asymptote renderer itself, as the same problem appears if just the asymptote html image document (e.g. https://www2.hawaii.edu/~gradysw/apex/images/img_cross1.html ) is placed inside the LMS app.

mattdricker commented 2 years ago

URL: https://quod.lib.umich.edu/f/frag/9772151.0006.001/--black-death-bodies?rgn=main;view=fulltext

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/787255475/

Via behavior: Variable when creating LMS assignment. Most often the page will stop loading with a Chrome error viahtml3.hypothes.is has redirected you too many times

Screen Shot 2022-02-21 at 9 33 19 AM

Have also witnessed the page loading fully but with the public Hypothesis web app loaded, rather than LMS app.

User reports seeing the error Via Error None Error Details: Internal Error: 'wsgi.url_scheme'

Screen Shot 2022-02-21 at 9 58 33 AM
mattdricker commented 2 years ago

URL: https://support.digication.com/hc/en-us/articles/235440448-Digication-Catalyst-For-Social-Learning

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/789050521/

Via behavior: Visiting any article on that site through Via triggers a CAPTCHA challenge.

Screen Shot 2022-02-22 at 1 34 27 PM
mkdir-washington-edu commented 2 years ago

URL: https://www.theoi.com/Text/OvidMetamorphoses1.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/809101584/

Via behavior: Gets to a cloudflare page that asks you to enable cookies

image
mkdir-washington-edu commented 2 years ago

URL: https://riojournal.com/articles.php?id=8812

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/810022493/

Via behavior: It looks like the page keeps trying to load an iframe over and over when loaded in Via:

image

(optional) Slack convo: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1646926006015759

robertknight commented 2 years ago

Via behavior: It looks like the page keeps trying to load an iframe over and over when loaded in Via:

I don't see this when visiting https://via.hypothes.is/https://riojournal.com/articles.php?id=8812. However there is an issue that there is an embedded copy of Hypothesis on the page, and an iframe with enable-annotation set, and some combination of those prevents the client from working.

When trying to annotate the page in Via, I see this error:

Guest RPC connection error

It looks like the Hypothesis code in the iframe is failing to connect to the sidebar/host parts of Hypothesis injected by Via.

robertknight commented 2 years ago

https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-1046968858 looks like a URL encoding issue. The first request below is the original URL, the second request is the URL that viahtml ends up requesting, the note percent-encoding of some characters in the query string.

$ curl -I -i 'https://quod.lib.umich.edu/f/frag/9772151.0006.001/--black-death-bodies?rgn=main;view=fulltext'
HTTP/1.1 200 OK
Date: Thu, 10 Mar 2022 16:14:05 GMT
Server: Apache/2.4.25 (Debian)
X-dlxs-legacy: HEAD /cgi/t/text/text-idx?cc=frag;c=frag;idno=9772151.0006.001;rgn=main;view=text HTTP/1.1
Set-Cookie: DLXSsid=f8769b840fb3863ec61c283f0bb49f60; domain=umich.edu; path=/
Vary: Accept-Encoding
Content-Type: text/html; charset=UTF-8
Strict-Transport-Security: max-age=31536000
Set-Cookie: STICKY=s146; path=/
Cache-control: private

$ curl -I -i 'https://quod.lib.umich.edu/f/frag/9772151.0006.001/--black-death-bodies?rgn=main%3Bview%3Dfulltext'
HTTP/1.1 301 Moved Permanently
Date: Thu, 10 Mar 2022 16:14:10 GMT
Server: Apache/2.4.25 (Debian)
X-dlxs-legacy: /f/frag/9772151.0006.001/--black-death-bodies?rgn=main%3Bview%3Dfulltext
Location: https://quod.lib.umich.edu/f/frag/9772151.0006.001/--black-death-bodies?rgn=main;view=fulltext
Strict-Transport-Security: max-age=31536000
Set-Cookie: STICKY=s147; path=/
Cache-control: private
mkdir-washington-edu commented 2 years ago

URL: https://time.com/5494404/tressie-mcmillan-cottom-thick-pregnancy-competent/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/841784359/

Via behavior: The header of thew page appears but there's no page context.

mattdricker commented 2 years ago

URL: https://www.npr.org/2021/06/24/1009578809/cops-say-low-morale-and-department-scrutiny-are-driving-them-away-from-the-job (though this same effect can be seen with any page on NPR's site)

No support ticket.

Via behavior: Odd sizing difference in the annotate/highlight modal between via and direct site URL. Also subtle sizing/spacing diffs on the bucket bar.

Via:

Screen Shot 2022-03-31 at 4 23 18 PM

Not via:

Screen Shot 2022-03-31 at 4 23 27 PM
mattdricker commented 2 years ago

URL: https://www.vox.com (issue occurs on any page at vox.com)

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/847941229/

Via behavior: Page gets stuck in a loop of continually reloading.

mattdricker commented 2 years ago

URL: https://reason.com/2022/04/03/against-scientific-gatekeeping/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/288798322/

Via behavior: Continual reload loop, as above vox.com issue.

Additional notes: Seems to be localized to Chrome (v100.0.4896.75). Can not reproduce in FF or Safari.

Each reload of the page adds an /[object%20Object] to the end of the URL

mkdir-washington-edu commented 2 years ago

URL: https://www.accreditedschoolsonline.org/resources/successful-online-student-characteristics/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/892690993/

Via behavior: Sidebar loses the open/close button and the Adder doesn't appear when text is selected.

Additional notes: Same behavior with the Chrome extension

mkdir-washington-edu commented 2 years ago

URL: https://writingspaces.org/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/911404893/

Via behavior: gives the following error:

Via Error https://writingspaces.org/?via.client.ignoreOtherConfiguration=1&via.client.openSidebar=1&via.external_link_mode=new-tab

Error Details:

{'args': {'coll': 'proxy', 'type': 'replay', 'index': '$live', 'templates': {'banner': 'templates/banner.rewriter', 'error': 'templates/error.rewriter', 'head_insert': 'templates/head_insert.rewriter', 'index': 'template/index.html', 'search': 'template/search.html', 'query': 'template/query.html'}}, 'error': '{"message": "https://writingspaces.org/", "errors": {"LiveWebLoader": "https://writingspaces.org/"}}'}

(optional) Slack convo: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1652709309801049

added note: later on that day the site started working again for the user and for us. I'm wondering if the site was testing some new code and pulled it back.

mkdir-washington-edu commented 2 years ago

URL: https://www.evernote.com/shard/s44/sh/cae6dc0b-a201-48a7-98da-1cf160b0194d/77a44581fa7bc271981aa48eff17569a

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/903995914/

Via behavior: In Android 10 on Chrome 101 and Duck Duck Go 5.124 the above link loads the Hypothesis Sidebar but does not load the underlying page. I replicated this problem on my device but can not replicate on my Mac (i.e. on my Mac Via displays the URL and the Sidebar).

mattdricker commented 2 years ago

URL: https://lorishyba.pbworks.com/f/salen_ecology.pdf

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/956175998

Via behavior: User receives via error message:

UpstreamServiceError: HTTPSConnectionPool(host='lorishyba.pbworks.com', port=443): Max retries exceeded with url: /f/salen_ecology.pdf (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1131)')))

URL: https://via.hypothes.is/https://lorishyba.pbworks.com/f/salen_ecology.pdf

Third party URL: https://lorishyba.pbworks.com/f/salen_ecology.pdf
Screen Shot 2022-06-17 at 9 48 07 AM
chrisdaaz commented 2 years ago

URL: https://www.smithsonianmag.com/science-nature/what-math-180975882/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1116443825

Via behavior: User receives via error message:

image

chrisdaaz commented 1 year ago

URL: https://ar5iv.labs.arxiv.org/html/2205.09940

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1378515515

Via behavior:

UpstreamServiceError: HTTPSConnectionPool(host='ar5iv.labs.arxiv.org', port=443): Max retries exceeded with url: /html/2205.09940 (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1131)')))

URL: https://via.hypothes.is/https://ar5iv.labs.arxiv.org/html/2205.09940

Third party URL: https://ar5iv.labs.arxiv.org/html/2205.09940

image

robertknight commented 1 year ago

I can reproduce in Python locally:

> import requests
> requests.get('https://ar5iv.labs.arxiv.org')
requests.exceptions.SSLError: HTTPSConnectionPool(host='ar5iv.labs.arxiv.org', port=443): Max retries exceeded with url: / (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1131)')))

I'm guessing we need to check that we've got certifi set up and up to date in Via.

chrisdaaz commented 1 year ago

@robertknight should i create a new issue for the ar5iv SSL error? wasn't sure if that's the next step

robertknight commented 1 year ago

@chrisdaaz - Yes, please do. Can you file it in the Via repository?

chrisdaaz commented 1 year ago

URL: https://www.publishersweekly.com/pw/by-topic/industry-news/publisher-news/article/90798-court-blocks-penguin-random-house-s-s-merger.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1395457118

Via behavior: The loading icon keeps spinning, never loading the annotations on the page. This is actually true for any Publisher's Weekly article

image

@robertknight given that the page loads, but the annotations never load, is this a separate GitHub issue that needs to be written? Happy to do so if so.

a bunch of JavaScript on their site won't finish loading, it seems:

image

Example: https://viahtml.hypothes.is/proxy/js_/https://chimpstatic.com/mcjs-connected/js/users/d684790bedf89afe76e7b9156/ac1bb0054950d604b6a28934a.js

chrisdaaz commented 1 year ago

URL: https://www.nngroup.com/articles/attention-economy/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1400341105

Via behavior:

Via changes the URL from /articles/attention-economy/ to /api/content/news/zones/articles-left/announcements/, which is what is saved in the database. An instructor who assigned articles from https://www.nngroup.com cannot see annotations made by students in the assignment view or in SpeedGrader; however can see (in SpeedGrader) that submissions are made.

Version: 1.1207.0
User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/109.0.0.0 Safari/537.36 Edg/109.0.1518.70
URL: https://www.nngroup.com/api/content/news/zones/articles-left/announcements/
Fingerprint: N/A
Account: chrisdaaz (acct:chris.diaz@hypothes.is)
Date: Tue Jan 31 2023 21:40:12 GMT-0600 (Central Standard Time)

https://via.hypothes.is/https://www.nngroup.com/articles/attention-economy/

I am wondering if using the Update Annotation URLs tool would fix their problem, @robertknight ? As a word of caution, there are hundreds of annotations linked to https://www.nngroup.com/api/content/news/zones/articles-left/announcements/ we think, more than just this class.

{
  "https://www.nngroup.com/api/content/news/zones/articles-left/announcements/": {
    "url": "https://www.nngroup.com/articles/attention-economy/",
    "document": { "title": "Welcome to page A" }
  }
}

For whatever reason, the web app works fine on that particular article. I can see 10 annotations in the public group.

When we refresh the page in both via and in the LMS, we get the proper URL for the article:

Version: 1.1207.0
User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/109.0.0.0 Safari/537.36 Edg/109.0.1518.70
URL: https://www.nngroup.com/articles/attention-economy/
Fingerprint: N/A
Account: Professor Dean (acct:6aa35b2213e455537e7460880bfc1a@lms.hypothes.is)
Date: Wed Feb 01 2023 09:09:12 GMT-0600 (Central Standard Time)
mkdir-washington-edu commented 1 year ago

URL: https://sway.office.com/7MCLvrJy7arFnVw8

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1507114425

Via behavior: Occasionally the page loads too slowly and all annotations orphan.

mkdir-washington-edu commented 1 year ago

URL: https://firstmonday.org/ojs/index.php/fm/article/view/466/387

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/1654080308

Via behavior: Text selection doesn't not trigger adder popup and does not trigger than change to the Page Note button.

janraev commented 1 year ago

URL: https://openstax.org/books/writing-guide/pages/3-6-editing-focus-sentence-structure

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/1923373410 https://app.hubspot.com/contacts/6291320/record/0-5/1923484208

Via behavior: Page displays briefly on the assignment and then it goes blank.

mkdir-washington-edu commented 1 year ago

URL: https://muse.jhu.edu/pub/48/article/885505

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/1785259189

Via behavior: Many page elements look broken. I'm unsure if this is a problem with our proxy or if the site isn't allowing these elements to be displayed in a proxy.

image

(optional) Slack convo: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1695387982385939

robertknight commented 1 year ago

Regarding OpenStax (https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-1730228919) I was able to reproduce locally using their open source reader app (https://github.com/openstax/rex-web), which is also hosted on Heroku (https://rex-web.herokuapp.com/).

Based on errors in the console, it looked like errors might relate to a script called osano.js (from https://cmp.osano.com/AzZqbXTbzhHsU3cv1/68d8e8ae-4024-4489-a000-72766ad284a6/osano.js). I tested blocking this script from loading, both in Chrome and Firefox, and the content was able to load successfully in that case.

I hadn't heard of Osano before, but it looks like a platform that provides GDPR banners (their sales pitch to customers on the front page is "No Fines. No Penalties", ie. use us and don't get sued). As long as the site continues to work without it, we could just block the entire domain.

robertknight commented 1 year ago

I have configured Via to block the domain cmp.osano.com, following the steps at https://stackoverflowteams.com/c/hypothesis/questions/102. This allows the OpenStax reader page at https://openstax.org/books/writing-guide/pages/3-6-editing-focus-sentence-structure to work in Via again. I did see an unrelated problem where existing highlights would disappear from the bucket bar after the page loaded.

janraev commented 1 year ago

URL: https://www.npr.org/transcripts/1098166279

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/1966628974/

Via behavior(s):

Browser used to replicate issue: Google Chrome Version 117.0.5938.150 (Official Build) (64-bit) (Happening in both Incognito and non-Incognito)

OS: Win 10

image image

Attached file: HAR file when it did not load at all NPR.zip

mkdir-washington-edu commented 11 months ago

URL: https://en.wikipedia.org/wiki/Cognitive_bias

Support ticket: N/A

Via behavior: Images on the page are not loading.

image

(optional) Slack convo:

mkdir-washington-edu commented 10 months ago

URL: https://www.sapiens.org/language/primate-vocalizations/

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2316682380

Via behavior: The background of the overlaid page is clear in Via but not clear normally. This causes the images of the background page to make reading the text difficult.

Normal page:

image

Via:

image

(optional) Slack convo:

janraev commented 9 months ago

URL: https://www.bloomberg.com/news/features/2022-11-10/a-challenge-for-cities-going-green-without-the-gentrification

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2383615853

Via behavior: The main article does not load. A message 'failed to render article' is displayed instead.

Screenshot 2024-02-19 at 10 07 26 PM
janraev commented 9 months ago

URL: https://www.facultyfocus.com/articles/online-education/online-student-engagement/15-ways-to-engage-your-students-in-person-online-and-in-zoom/

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2433701537

Via behavior: The main article does not load. No error message.

Screenshot 2024-03-04 at 9 00 28 PM
mkdir-washington-edu commented 8 months ago

Adding to the comment directly above @janraev FYI Article content doesn't appear in Via, but it is loaded in the page elements.

regular page:

image

page in Via:

image
robertknight commented 8 months ago

URL: https://www.facultyfocus.com/articles/online-education/online-student-engagement/15-ways-to-engage-your-students-in-person-online-and-in-zoom/

The website's content is present in the page but hidden by some inline styles on an element. I think what is supposed to happen is that after certain content loads (a font?), that content is unhidden. However in Via, the JavaScript that does the un-hiding crashes and so the content is never revealed.

The JS error is:

Uncaught TypeError: 'get' on proxy: property 'removeEventListener' is a read-only and non-configurable data property on the proxy target but the proxy did not return its actual value (expected 'function(t){var r="removeEventListener"+t;if(typeof e[r]!=="undefined"){e[r].apply(e,arguments)}else{n.apply(e,arguments)}}' but got 'function () { [native code] }')
    at t (TYVZHePxisufUuSiVWDElscksnaOxEbE:15:39229)

On this JS:

loadQueuedResources: async function() {
    window.dispatchEvent(new Event("NitroBootStart"));
    if (d) {
        clearTimeout(d);
        d = null
    }
    window.removeEventListener("load", NPRL.loadQueuedResources);
    a = r == 1 ? J : V;
    if (!n || s) {
        _(a)
    } else {
...
janraev commented 8 months ago

URL: https://hcommons.org/app/uploads/sites/1003683/2024/02/DKTZLTV.pdf

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2499966489

Via behavior: The PDF loads but the H sidebar doesn't. It appears to be loading something as indicated by the red bar under the address bar, but it does not progress any further.

Screenshot 2024-03-19 at 2 25 01 AM

The same URL works when used in an LMS assignment. See https://hypothesis.instructure.com/courses/585/assignments/6492. The title in the first page of the PDF though have changed to what seems to be the corresponding text in the text layer. See annotation in the assignment for reference.

robertknight commented 8 months ago

The server behind this URL is doing some filtering based on the User-Agent header, and possibly others. I can't remember what User Agent Via uses, but this might mean that the content returned is different when the server fetches the URL (returns HTML 403) and when the browser fetches it (returns PDF). As a result, Via "detects" the content as HTML but it loads in the browser as a PDF.

curl -I 'https://hcommons.org/app/uploads/sites/1003683/2024/02/DKTZLTV.pdf'
HTTP/2 403
server: awselb/2.0
date: Tue, 19 Mar 2024 06:46:20 GMT
content-type: text/html
content-length: 118

curl -I 'https://hcommons.org/app/uploads/sites/1003683/2024/02/DKTZLTV.pdf' -A 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/17.2.1 Safari/605.1.15'
HTTP/2 200
date: Tue, 19 Mar 2024 06:45:53 GMT
content-type: application/pdf
content-length: 396430
janraev commented 7 months ago

URL: https://time.com/6553085/jo-koy-golden-globes-barbie-joke/

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2591646987

Via behavior: The article displays for a brief moment and the page goes entirely blank