hypothesis / product-backlog

Where new feature ideas and current bugs for the Hypothesis product live
118 stars 7 forks source link

URLs that do not work correctly in Via or the LMS app #1270

Open mkdir-washington-edu opened 2 years ago

mkdir-washington-edu commented 2 years ago

Bug report form

See: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1642175183152600?thread_ts=1641839979.048300&cid=C2BLQDKHA

tl;dr We see Via users (especially instructors using the LMS app) not always anticipating the issues they may get on URLs they want to annotate. Since this is largely about LMS users the chrome extension and bookmarklet are not viable options.

This issue will be to track "problem URLs"; the reasons might be log ins, our proxy being blocked, content within iframes, and other reasons.

Support folks adding URLs to this form, please list out:

Support can help keep this list useful by, of course, vetting these sites and make sure the user is reasonable in what they're linking to. Reasonable would be a site where a user can see the text they would expect students to annotate.

Template for comments: URL:

Support ticket:

Via behavior:

(optional) Slack convo:

janraev commented 6 months ago

URL: https://nces.ed.gov/surveys/pirls/pirls2021/index.asp

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2795447113

Via behavior: The body of text in the webpage does not load.

image

janraev commented 5 months ago

URL: https://www.colorado.edu/health/blog/cannabis

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2913397121

Via behavior: It takes at least a full minute to see body of texts in the webpage that can be annotated.

There are several console errors. See screenshot below:

Screenshot 2024-07-04 at 4 40 43 AM
robertknight commented 5 months ago

URL: https://www.colorado.edu/health/blog/cannabis

This page inserts a large number of <script> elements in the <head> without async or defer attributes. As a result they block loading of the body. A certain number of these scripts are loaded successfully, but then Via hangs on one of them and it blocks until the request eventually hits a timeout after a minute.

janraev commented 4 months ago

URL: https://www.joanganzcooneycenter.org/wp-content/uploads/2014/10/jgcc_leveluplearning_final.pdf

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2955709528

Via behavior: The PDF loads but the H sidebar doesn't. It appears to be loading something as indicated by the red bar under the address bar, but it does not progress any further.

image

Possibly the same as https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-2004670707?

robertknight commented 4 months ago

Possibly the same as https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-2004670707?

Yes. In both cases it looks like the PDF is being loaded in the native PDF viewer, instead of our custom annotation-enabled one. This can happen if the "router" in Via detects the content type as HTML and tries to load it as such, but then the content actually turns out to be a PDF.

I notice that fetching this file with cURL returns a 403:

$ curl -i 'https://www.joanganzcooneycenter.org/wp-content/uploads/2014/10/jgcc_leveluplearning_final.pdf'
HTTP/2 403
server: awselb/2.0
date: Fri, 12 Jul 2024 13:38:26 GMT
content-type: text/html
content-length: 118

Spoofing a browser by setting the User-Agent header results in the real PDF being returned:

$ curl -i 'https://www.joanganzcooneycenter.org/wp-content/uploads/2014/10/jgcc_leveluplearning_final.pdf' -H 'User-Agent: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/17.4.1 Safari/605.1.15'
HTTP/2 200
date: Fri, 12 Jul 2024 13:39:08 GMT
content-type: application/pdf
server: Apache/2.4.41 (Ubuntu)

So we should check what user agent we are using for the initial request for the content in Via that discovers what kind of content it is.

janraev commented 4 months ago

URL: https://storymaps.arcgis.com/stories/62f54d97f76f4e3cb50775d309be6cce

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3011890245

Via behavior: Webpage displayed is just blank. Console shows 504 error.

image

From a partner school, this is what displayed for them in the LMS:

image

janraev commented 3 months ago

URL: https://manifold.open.umn.edu/read/complete-text/section/66ad2737-0444-4c0c-beda-0bbeee592c38

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3067397767

Via Behavior: Page opens up along with the sidebar. When attempting to annotate, the built-in annotator from Manifold is the only one showing up.

image

mkdir-washington-edu commented 3 months ago

For https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-2269945443

While barely knowing what I'm talking about, I'd bet that their annotator is swallowing the text selection event so we don't even see that it's happening.

janraev commented 3 months ago

URL: https://www.historians.org/resource/why-study-history/

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3158963352

Via Behavior: It takes a minute before the H sidebar shows up. Similar to https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-2207232861

janraev commented 3 months ago

URL: https://www.abdallahcandies.com/information/7-health-benefits-of-eating-chocolate/

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3159861007

Via Behavior: It takes a minute before the H sidebar shows up. Similar to https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-2207232861

janraev commented 3 months ago

URL:

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3163364468

Via Behavior: It takes a minute before the H sidebar shows up. Similar to https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-2207232861

janraev commented 3 months ago

URL: https://storymaps.arcgis.com/stories/abba237bedbc4e2a98e65654cbfc7e65/ Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3163599147

Via Behavior: Page does not load and console shows 504 error.

image

janraev commented 2 months ago

URL: https://www.nais.org/magazine/independent-school/winter-2008/brainology/ Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3176373945

Via Behavior: It takes a minute before the H sidebar shows up. Similar to https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-2207232861

janraev commented 2 months ago

URL: https://www.nu.edu/blog/what-is-edtech-education-technology/ Support https://app.hubspot.com/contacts/6291320/record/0-5/3186385249

Via Behavior: It takes a minute before the H sidebar shows up. Similar to https://github.com/hypothesis/product-backlog/issues/1270#issuecomment-2207232861

janraev commented 2 months ago

URL: https://www.defenseone.com/threats/2015/06/its-nearly-impossible-understand-what-motivates-terrorists/114856/ Support https://app.hubspot.com/contacts/6291320/record/0-5/3187469362

Via Behavior: Page eventually loads but there are a lot of images that are missing. Console shows a lot of 504 errors

image

janraev commented 2 months ago

URL: https://www.chronicle.com/article/the-boys-are-doing-just-fine Support https://app.hubspot.com/contacts/6291320/record/0-5/3261288344

Via Behavior: Prompted that the browser is out of date

image

janraev commented 2 months ago

URL: https://www.sierraclub.org/sierra/future-oceans-environmental-justice-climate-change Support https://app.hubspot.com/contacts/6291320/record/0-5/3261288344

Via Behavior: Prompted with additional check via Captcha

image

janraev commented 2 months ago

URL: https://pressbooks.bccampus.ca/technicalwriting/chapter/communicatingprecision/ Support https://app.hubspot.com/contacts/6291320/record/0-5/3259309321

Via Behavior: Loads after a minute with some components loaded incompletely.

image

janraev commented 2 months ago

URL: http://www.newpaltz.edu//online/online-degrees/bgs/ Support https://app.hubspot.com/contacts/6291320/record/0-5/3260539314 Via Behavior: Loads after a minute in the LMS app. Assets do not display correctly on certain occasions.

Reporting instructor experienced this issue:

image

janraev commented 2 months ago

URL: https://www.pewresearch.org/religion/2023/06/02/online-religious-services-appeal-to-many-americans-but-going-in-person-remains-more-popular/ Support https://app.hubspot.com/contacts/6291320/record/0-5/3314561584 Via Behavior: Loads after more than a minute in the LMS app.

janraev commented 2 months ago

URL: http://www.americanyawp.com/text/introduction/ Support https://app.hubspot.com/contacts/6291320/record/0-5/3389621119 Via Behavior: Loads after more than a minute in the LMS app.

janraev commented 2 months ago

URL: https://pressbooks.bccampus.ca/onlinepresence/chapter/onlineprivacy/ Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3388073554 Via Behavior: Loads after more than a minute in the LMS app.

janraev commented 2 months ago

URL: http://www.americanyawp.com/text/introduction/ Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3389621119 Via Behavior: Loads after more than a minute in the LMS app.

janraev commented 2 months ago

URL: https://uark.pressbooks.pub/speaking/chapter/organizing-a-speech/ Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3307148845 Via Behavior: Intermittent load behavior in the LMS app. Loading time varies from less than 10 secs to more than a minute.

janraev commented 2 months ago

URL: https://www.pewresearch.org/religion/2023/06/02/online-religious-services-appeal-to-many-americans-but-going-in-person-remains-more-popular/ Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3314561584 Via Behavior: Loads after a minute. Intermittently, some assets are not loaded correctly.

janraev commented 2 months ago

URL: https://www.defenseone.com/threats/2015/06/its-nearly-impossible-understand-what-motivates-terrorists/114856/ Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3450390898 Via Behavior: Takes a long time to load.

mkdir-washington-edu commented 1 month ago

URL: https://www.philosophersmag.com/essays/52-an-antidote-to-injustice - this URL automatically forwards to https://philosophersmag.com/an-antidote-to-injustice/ Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/3887260706 Via behavior: in the LMS context you sometimes see an error, sometimes you see the LMS app group(s) (which is what we'd want to happen), and sometimes the web app launches. Slack thread: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1728319953382659 Assignment reproduction: https://hypothesis.instructure.com/courses/609/assignments/7779.

Image Image