Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

URLs that do not work correctly in Via or the LMS app #1270

Open
mkdir-washington-edu opened this issue Jan 14, 2022 · 50 comments
Open

URLs that do not work correctly in Via or the LMS app #1270

mkdir-washington-edu opened this issue Jan 14, 2022 · 50 comments

Comments

@mkdir-washington-edu
Copy link

mkdir-washington-edu commented Jan 14, 2022

Bug report form

See: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1642175183152600?thread_ts=1641839979.048300&cid=C2BLQDKHA

tl;dr We see Via users (especially instructors using the LMS app) not always anticipating the issues they may get on URLs they want to annotate. Since this is largely about LMS users the chrome extension and bookmarklet are not viable options.

This issue will be to track "problem URLs"; the reasons might be log ins, our proxy being blocked, content within iframes, and other reasons.

Support folks adding URLs to this form, please list out:

  • the URLs,
  • the link to a support ticket where available,
  • the observed behavior of the site in Via,
  • if available, a link to a Slack conversation where a developer diagnosed the problem.

Support can help keep this list useful by, of course, vetting these sites and make sure the user is reasonable in what they're linking to. Reasonable would be a site where a user can see the text they would expect students to annotate.

Template for comments:
URL:

Support ticket:

Via behavior:

(optional) Slack convo:

@mkdir-washington-edu
Copy link
Author

URL: https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(21)02870-1/fulltext

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/732776255/

Via behavior: Reported by the user:

Many students and he himself are receiving the error “viahtml3.hypothesis.is redirected you too many times.” Sometimes, the page the instructor is trying to use hypothesis on won't load, even though the hypothesis add on will. Finally, the instructor occasionally gets the following Error: "Via Error None Error Details:Internal Error: 'wsgi.url_scheme'"

@mkdir-washington-edu
Copy link
Author

mkdir-washington-edu commented Jan 14, 2022

URL: https://design-justice.pubpub.org/pub/ap8rgw5e/release/1

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/733849931/

Via behavior: After creating an annotation the page gives the error:

An error has occured in the editor.
We've logged the error and will look into the cause right away.
All previous changes have been successfully saved.
To continue editing, please refresh the page.

Related: https://github.com/hypothesis/support/issues/210

@mkdir-washington-edu mkdir-washington-edu changed the title URLs that do not work correctly in Via for investigation URLs that do not work correctly in Via or the LMS app Jan 14, 2022
@mkdir-washington-edu
Copy link
Author

URL: https://www.independent.co.uk/life-style/aren-t-you-happy-me-1621301.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/667004825/

Via behavior: The site automatically forwards to https://www.independent.co.uk/life-style/aren-t-you-happy**-for**-me-1621301.html, but annotations created before the forwarding and new URL were set up are associated with the first URL, so they are no longer accessible in the context of the LMS app.

@mkdir-washington-edu
Copy link
Author

@mkdir-washington-edu
Copy link
Author

URL: not specific, not New York Times websites

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/728305781/

Via behavior: User reports:

New York Times paywall did not used to cause problems, but even when I am logged into my NYT account, I now cannot get access to any articles via Hypothesis in Canvas. When I am invited to login, it seems that I can enter my nyt credentials, but then the captcha fails.

@mkdir-washington-edu
Copy link
Author

URL: http://rhetoric.byu.edu/

Support ticket: none

Via behavior: Adder doesn't appear when selecting text

Slack convo: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1641947154097300

@mattdricker
Copy link

URL: https://via.hypothes.is/https://cvc.cervantes.es/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/477078696/

Via behavior:

UpstreamServiceError: (‘Connection aborted.’, ConnectionResetError(104, ‘Connection reset by peer’))
URL: https://via.hypothes.is/https://cvc.cervantes.es
Third party URL: None

125513356-a3db8224-e246-407d-92bd-266ed470d39d

Related: https://github.com/hypothesis/support/issues/215#issue-943763801

@mattdricker
Copy link

URL: https://www.dhm.de/lemo/kapitel/weimarer-republik/alltagsleben.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/757452373/

Via behavior:

UpstreamServiceError: HTTPSConnectionPool(host='www.dhm.de', port=443): Max retries exceeded with url: /lemo/kapitel/weimarer-republik/alltagsleben.html (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1131)')))

URL: https://via.hypothes.is/https://www.dhm.de/lemo/kapitel/weimarer-republik/alltagsleben.html

Third party URL: https://www.dhm.de/lemo/kapitel/weimarer-republik/alltagsleben.html

Screen Shot 2022-01-28 at 4 11 35 PM

@mkdir-washington-edu
Copy link
Author

mkdir-washington-edu commented Feb 14, 2022

Ignore this. See Rob's comment below.

URL: https://musicbrainz.org/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/778913126/

Via behavior: Page font gets larger. Happens with Bookmarklet as well.

@robertknight
Copy link
Member

robertknight commented Feb 15, 2022

Via behavior: Page font gets larger. Happens with Bookmarklet as well.

Can you pull this one out into a separate issue. It looks like this relates to some PDF.js-specific changes that happen to be loaded in normal web pages as well. Two things we need to do here:

  1. Not load the pdfjs-overrides.css file outside of PDF.js (see Only load PDF.js style overrides in PDF.js client#4224)
  2. Continue looking for solutions to the adder REM-sizing issue that don't affect the host page.

CC @lyzadanger (this can wait until you are back)

robertknight added a commit to hypothesis/client that referenced this issue Feb 15, 2022
Only load the `pdfjs-overrides.css` bundle when the client is loaded in
PDF.js.

This fixes an issue where a global styling change that was recently
added specifically for PDF.js [1] ended up affecting other web pages
[2].

[1] #4208
[2] hypothesis/product-backlog#1270 (comment)
robertknight added a commit to hypothesis/client that referenced this issue Feb 15, 2022
Only load the `pdfjs-overrides.css` bundle when the client is loaded in
PDF.js.

This fixes an issue where a global styling change that was recently
added specifically for PDF.js [1] ended up affecting other web pages
[2].

[1] #4208
[2] hypothesis/product-backlog#1270 (comment)
@mkdir-washington-edu
Copy link
Author

@robertknight Thank you! #1317

@mkdir-washington-edu
Copy link
Author

URL: https://opentext.uleth.ca/apex-video/sec_disk.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/781988657

Via behavior: embedded javascript in the page doesn't work in Via. From the user:

The problem seems to be that the app is interfering with the Asymptote renderer itself, as the same problem appears if just the asymptote html image document (e.g. https://www2.hawaii.edu/~gradysw/apex/images/img_cross1.html ) is placed inside the LMS app.

@mattdricker
Copy link

URL: https://quod.lib.umich.edu/f/frag/9772151.0006.001/--black-death-bodies?rgn=main;view=fulltext

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/787255475/

Via behavior: Variable when creating LMS assignment. Most often the page will stop loading with a Chrome error viahtml3.hypothes.is has redirected you too many times

Screen Shot 2022-02-21 at 9 33 19 AM

Have also witnessed the page loading fully but with the public Hypothesis web app loaded, rather than LMS app.

User reports seeing the error Via Error None Error Details: Internal Error: 'wsgi.url_scheme'

Screen Shot 2022-02-21 at 9 58 33 AM

@mattdricker
Copy link

URL: https://support.digication.com/hc/en-us/articles/235440448-Digication-Catalyst-For-Social-Learning

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/789050521/

Via behavior: Visiting any article on that site through Via triggers a CAPTCHA challenge.
Screen Shot 2022-02-22 at 1 34 27 PM

@mkdir-washington-edu
Copy link
Author

URL: https://www.theoi.com/Text/OvidMetamorphoses1.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/809101584/

Via behavior: Gets to a cloudflare page that asks you to enable cookies
image

@mkdir-washington-edu
Copy link
Author

URL: https://riojournal.com/articles.php?id=8812

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/810022493/

Via behavior: It looks like the page keeps trying to load an iframe over and over when loaded in Via:
image

(optional) Slack convo: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1646926006015759

@robertknight
Copy link
Member

robertknight commented Mar 10, 2022

Via behavior: It looks like the page keeps trying to load an iframe over and over when loaded in Via:

I don't see this when visiting https://via.hypothes.is/https://riojournal.com/articles.php?id=8812. However there is an issue that there is an embedded copy of Hypothesis on the page, and an iframe with enable-annotation set, and some combination of those prevents the client from working.

When trying to annotate the page in Via, I see this error:

Guest RPC connection error

It looks like the Hypothesis code in the iframe is failing to connect to the sidebar/host parts of Hypothesis injected by Via.

@robertknight
Copy link
Member

robertknight commented Mar 10, 2022

#1270 (comment) looks like a URL encoding issue. The first request below is the original URL, the second request is the URL that viahtml ends up requesting, the note percent-encoding of some characters in the query string.

$ curl -I -i 'https://quod.lib.umich.edu/f/frag/9772151.0006.001/--black-death-bodies?rgn=main;view=fulltext'
HTTP/1.1 200 OK
Date: Thu, 10 Mar 2022 16:14:05 GMT
Server: Apache/2.4.25 (Debian)
X-dlxs-legacy: HEAD /cgi/t/text/text-idx?cc=frag;c=frag;idno=9772151.0006.001;rgn=main;view=text HTTP/1.1
Set-Cookie: DLXSsid=f8769b840fb3863ec61c283f0bb49f60; domain=umich.edu; path=/
Vary: Accept-Encoding
Content-Type: text/html; charset=UTF-8
Strict-Transport-Security: max-age=31536000
Set-Cookie: STICKY=s146; path=/
Cache-control: private

$ curl -I -i 'https://quod.lib.umich.edu/f/frag/9772151.0006.001/--black-death-bodies?rgn=main%3Bview%3Dfulltext'
HTTP/1.1 301 Moved Permanently
Date: Thu, 10 Mar 2022 16:14:10 GMT
Server: Apache/2.4.25 (Debian)
X-dlxs-legacy: /f/frag/9772151.0006.001/--black-death-bodies?rgn=main%3Bview%3Dfulltext
Location: https://quod.lib.umich.edu/f/frag/9772151.0006.001/--black-death-bodies?rgn=main;view=fulltext
Strict-Transport-Security: max-age=31536000
Set-Cookie: STICKY=s147; path=/
Cache-control: private

@mkdir-washington-edu
Copy link
Author

URL: https://time.com/5494404/tressie-mcmillan-cottom-thick-pregnancy-competent/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/841784359/

Via behavior: The header of thew page appears but there's no page context.

@mattdricker
Copy link

URL: https://www.npr.org/2021/06/24/1009578809/cops-say-low-morale-and-department-scrutiny-are-driving-them-away-from-the-job (though this same effect can be seen with any page on NPR's site)

No support ticket.

Via behavior: Odd sizing difference in the annotate/highlight modal between via and direct site URL. Also subtle sizing/spacing diffs on the bucket bar.

Via:
Screen Shot 2022-03-31 at 4 23 18 PM

Not via:
Screen Shot 2022-03-31 at 4 23 27 PM

@mattdricker
Copy link

URL: https://www.vox.com (issue occurs on any page at vox.com)

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/847941229/

Via behavior: Page gets stuck in a loop of continually reloading.

@mattdricker
Copy link

URL: https://reason.com/2022/04/03/against-scientific-gatekeeping/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/288798322/

Via behavior: Continual reload loop, as above vox.com issue.

Additional notes: Seems to be localized to Chrome (v100.0.4896.75). Can not reproduce in FF or Safari.

Each reload of the page adds an /[object%20Object] to the end of the URL

@mkdir-washington-edu
Copy link
Author

URL: https://www.accreditedschoolsonline.org/resources/successful-online-student-characteristics/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/892690993/

Via behavior: Sidebar loses the open/close button and the Adder doesn't appear when text is selected.

Additional notes: Same behavior with the Chrome extension

@mkdir-washington-edu
Copy link
Author

mkdir-washington-edu commented May 16, 2022

URL: https://writingspaces.org/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/911404893/

Via behavior: gives the following error:

Via Error
https://writingspaces.org/?via.client.ignoreOtherConfiguration=1&via.client.openSidebar=1&via.external_link_mode=new-tab

Error Details:

{'args': {'coll': 'proxy', 'type': 'replay', 'index': '$live', 'templates': {'banner': 'templates/banner.rewriter', 'error': 'templates/error.rewriter', 'head_insert': 'templates/head_insert.rewriter', 'index': 'template/index.html', 'search': 'template/search.html', 'query': 'template/query.html'}}, 'error': '{"message": "https://writingspaces.org/", "errors": {"LiveWebLoader": "https://writingspaces.org/"}}'}

(optional) Slack convo: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1652709309801049

added note: later on that day the site started working again for the user and for us. I'm wondering if the site was testing some new code and pulled it back.

@mkdir-washington-edu
Copy link
Author

URL: https://www.evernote.com/shard/s44/sh/cae6dc0b-a201-48a7-98da-1cf160b0194d/77a44581fa7bc271981aa48eff17569a

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/903995914/

Via behavior: In Android 10 on Chrome 101 and Duck Duck Go 5.124 the above link loads the Hypothesis Sidebar but does not load the underlying page. I replicated this problem on my device but can not replicate on my Mac (i.e. on my Mac Via displays the URL and the Sidebar).

@mattdricker
Copy link

URL: https://lorishyba.pbworks.com/f/salen_ecology.pdf

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/956175998

Via behavior: User receives via error message:

UpstreamServiceError: HTTPSConnectionPool(host='lorishyba.pbworks.com', port=443): Max retries exceeded with url: /f/salen_ecology.pdf (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1131)')))

URL: https://via.hypothes.is/https://lorishyba.pbworks.com/f/salen_ecology.pdf

Third party URL: https://lorishyba.pbworks.com/f/salen_ecology.pdf

Screen Shot 2022-06-17 at 9 48 07 AM

@chrisdaaz
Copy link

URL: https://www.smithsonianmag.com/science-nature/what-math-180975882/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1116443825

Via behavior: User receives via error message:

image

@chrisdaaz
Copy link

URL: https://ar5iv.labs.arxiv.org/html/2205.09940

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1378515515

Via behavior:

UpstreamServiceError: HTTPSConnectionPool(host='ar5iv.labs.arxiv.org', port=443): Max retries exceeded with url: /html/2205.09940 (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1131)')))

URL: https://via.hypothes.is/https://ar5iv.labs.arxiv.org/html/2205.09940

Third party URL: https://ar5iv.labs.arxiv.org/html/2205.09940

image

@robertknight
Copy link
Member

robertknight commented Jan 18, 2023

I can reproduce in Python locally:

> import requests
> requests.get('https://ar5iv.labs.arxiv.org')
requests.exceptions.SSLError: HTTPSConnectionPool(host='ar5iv.labs.arxiv.org', port=443): Max retries exceeded with url: / (Caused by SSLError(SSLCertVerificationError(1, '[SSL: CERTIFICATE_VERIFY_FAILED] certificate verify failed: unable to get local issuer certificate (_ssl.c:1131)')))

I'm guessing we need to check that we've got certifi set up and up to date in Via.

@chrisdaaz
Copy link

@robertknight should i create a new issue for the ar5iv SSL error? wasn't sure if that's the next step

@robertknight
Copy link
Member

@chrisdaaz - Yes, please do. Can you file it in the Via repository?

@chrisdaaz
Copy link

URL: https://www.publishersweekly.com/pw/by-topic/industry-news/publisher-news/article/90798-court-blocks-penguin-random-house-s-s-merger.html

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1395457118

Via behavior: The loading icon keeps spinning, never loading the annotations on the page. This is actually true for any Publisher's Weekly article

image

@robertknight given that the page loads, but the annotations never load, is this a separate GitHub issue that needs to be written? Happy to do so if so.

a bunch of JavaScript on their site won't finish loading, it seems:

image

Example: https://viahtml.hypothes.is/proxy/js_/https://chimpstatic.com/mcjs-connected/js/users/d684790bedf89afe76e7b9156/ac1bb0054950d604b6a28934a.js

@chrisdaaz
Copy link

chrisdaaz commented Feb 1, 2023

URL: https://www.nngroup.com/articles/attention-economy/

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1400341105

Via behavior:

Via changes the URL from /articles/attention-economy/ to /api/content/news/zones/articles-left/announcements/, which is what is saved in the database. An instructor who assigned articles from https://www.nngroup.com cannot see annotations made by students in the assignment view or in SpeedGrader; however can see (in SpeedGrader) that submissions are made.

Version: 1.1207.0
User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/109.0.0.0 Safari/537.36 Edg/109.0.1518.70
URL: https://www.nngroup.com/api/content/news/zones/articles-left/announcements/
Fingerprint: N/A
Account: chrisdaaz (acct:chris.diaz@hypothes.is)
Date: Tue Jan 31 2023 21:40:12 GMT-0600 (Central Standard Time)

https://via.hypothes.is/https://www.nngroup.com/articles/attention-economy/

I am wondering if using the Update Annotation URLs tool would fix their problem, @robertknight ? As a word of caution, there are hundreds of annotations linked to https://www.nngroup.com/api/content/news/zones/articles-left/announcements/ we think, more than just this class.

{
  "https://www.nngroup.com/api/content/news/zones/articles-left/announcements/": {
    "url": "https://www.nngroup.com/articles/attention-economy/",
    "document": { "title": "Welcome to page A" }
  }
}

For whatever reason, the web app works fine on that particular article. I can see 10 annotations in the public group.

When we refresh the page in both via and in the LMS, we get the proper URL for the article:

Version: 1.1207.0
User Agent: Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/109.0.0.0 Safari/537.36 Edg/109.0.1518.70
URL: https://www.nngroup.com/articles/attention-economy/
Fingerprint: N/A
Account: Professor Dean (acct:6aa35b2213e455537e7460880bfc1a@lms.hypothes.is)
Date: Wed Feb 01 2023 09:09:12 GMT-0600 (Central Standard Time)

@mkdir-washington-edu
Copy link
Author

URL: https://sway.office.com/7MCLvrJy7arFnVw8

Support ticket: https://app.hubspot.com/contacts/6291320/ticket/1507114425

Via behavior: Occasionally the page loads too slowly and all annotations orphan.

@mkdir-washington-edu
Copy link
Author

URL: https://firstmonday.org/ojs/index.php/fm/article/view/466/387

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/1654080308

Via behavior: Text selection doesn't not trigger adder popup and does not trigger than change to the Page Note button.

@janraev
Copy link

janraev commented Sep 21, 2023

@mkdir-washington-edu
Copy link
Author

URL: https://muse.jhu.edu/pub/48/article/885505

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/1785259189

Via behavior: Many page elements look broken. I'm unsure if this is a problem with our proxy or if the site isn't allowing these elements to be displayed in a proxy.
image

(optional) Slack convo: https://hypothes-is.slack.com/archives/C2BLQDKHA/p1695387982385939

@robertknight
Copy link
Member

Regarding OpenStax (#1270 (comment)) I was able to reproduce locally using their open source reader app (https://github.com/openstax/rex-web), which is also hosted on Heroku (https://rex-web.herokuapp.com/).

Based on errors in the console, it looked like errors might relate to a script called osano.js (from https://cmp.osano.com/AzZqbXTbzhHsU3cv1/68d8e8ae-4024-4489-a000-72766ad284a6/osano.js). I tested blocking this script from loading, both in Chrome and Firefox, and the content was able to load successfully in that case.

I hadn't heard of Osano before, but it looks like a platform that provides GDPR banners (their sales pitch to customers on the front page is "No Fines. No Penalties", ie. use us and don't get sued). As long as the site continues to work without it, we could just block the entire domain.

@robertknight
Copy link
Member

I have configured Via to block the domain cmp.osano.com, following the steps at https://stackoverflowteams.com/c/hypothesis/questions/102. This allows the OpenStax reader page at https://openstax.org/books/writing-guide/pages/3-6-editing-focus-sentence-structure to work in Via again. I did see an unrelated problem where existing highlights would disappear from the bucket bar after the page loaded.

@janraev
Copy link

janraev commented Oct 12, 2023

URL: https://www.npr.org/transcripts/1098166279

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/1966628974/

Via behavior(s):

  • Page loads fine visually. When you start annotating, it becomes unresponsive.
  • Page does not load at all .

Browser used to replicate issue: Google Chrome Version 117.0.5938.150 (Official Build) (64-bit) (Happening in both Incognito and non-Incognito)

OS: Win 10

image
image

Attached file: HAR file when it did not load at all
NPR.zip

@mkdir-washington-edu
Copy link
Author

URL: https://en.wikipedia.org/wiki/Cognitive_bias

Support ticket: N/A

Via behavior: Images on the page are not loading.
image

(optional) Slack convo:

@mkdir-washington-edu
Copy link
Author

URL: https://www.sapiens.org/language/primate-vocalizations/

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2316682380

Via behavior: The background of the overlaid page is clear in Via but not clear normally. This causes the images of the background page to make reading the text difficult.

Normal page:
image

Via:
image

(optional) Slack convo:

@janraev
Copy link

janraev commented Feb 19, 2024

URL: https://www.bloomberg.com/news/features/2022-11-10/a-challenge-for-cities-going-green-without-the-gentrification

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2383615853

Via behavior: The main article does not load. A message 'failed to render article' is displayed instead.

Screenshot 2024-02-19 at 10 07 26 PM

@janraev
Copy link

janraev commented Mar 4, 2024

@mkdir-washington-edu
Copy link
Author

Adding to the comment directly above
@janraev FYI
Article content doesn't appear in Via, but it is loaded in the page elements.

regular page:
image

page in Via:
image

@robertknight
Copy link
Member

robertknight commented Mar 7, 2024

URL: https://www.facultyfocus.com/articles/online-education/online-student-engagement/15-ways-to-engage-your-students-in-person-online-and-in-zoom/

The website's content is present in the page but hidden by some inline styles on an element. I think what is supposed to happen is that after certain content loads (a font?), that content is unhidden. However in Via, the JavaScript that does the un-hiding crashes and so the content is never revealed.

The JS error is:

Uncaught TypeError: 'get' on proxy: property 'removeEventListener' is a read-only and non-configurable data property on the proxy target but the proxy did not return its actual value (expected 'function(t){var r="removeEventListener"+t;if(typeof e[r]!=="undefined"){e[r].apply(e,arguments)}else{n.apply(e,arguments)}}' but got 'function () { [native code] }')
    at t (TYVZHePxisufUuSiVWDElscksnaOxEbE:15:39229)

On this JS:

loadQueuedResources: async function() {
    window.dispatchEvent(new Event("NitroBootStart"));
    if (d) {
        clearTimeout(d);
        d = null
    }
    window.removeEventListener("load", NPRL.loadQueuedResources);
    a = r == 1 ? J : V;
    if (!n || s) {
        _(a)
    } else {
...

@janraev
Copy link

janraev commented Mar 18, 2024

URL: https://hcommons.org/app/uploads/sites/1003683/2024/02/DKTZLTV.pdf

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2499966489

Via behavior: The PDF loads but the H sidebar doesn't. It appears to be loading something as indicated by the red bar under the address bar, but it does not progress any further.

Screenshot 2024-03-19 at 2 25 01 AM

The same URL works when used in an LMS assignment. See https://hypothesis.instructure.com/courses/585/assignments/6492. The title in the first page of the PDF though have changed to what seems to be the corresponding text in the text layer. See annotation in the assignment for reference.

@robertknight
Copy link
Member

robertknight commented Mar 19, 2024

The server behind this URL is doing some filtering based on the User-Agent header, and possibly others. I can't remember what User Agent Via uses, but this might mean that the content returned is different when the server fetches the URL (returns HTML 403) and when the browser fetches it (returns PDF). As a result, Via "detects" the content as HTML but it loads in the browser as a PDF.

curl -I 'https://hcommons.org/app/uploads/sites/1003683/2024/02/DKTZLTV.pdf'
HTTP/2 403
server: awselb/2.0
date: Tue, 19 Mar 2024 06:46:20 GMT
content-type: text/html
content-length: 118

curl -I 'https://hcommons.org/app/uploads/sites/1003683/2024/02/DKTZLTV.pdf' -A 'Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/17.2.1 Safari/605.1.15'
HTTP/2 200
date: Tue, 19 Mar 2024 06:45:53 GMT
content-type: application/pdf
content-length: 396430

@janraev
Copy link

janraev commented Apr 10, 2024

URL: https://time.com/6553085/jo-koy-golden-globes-barbie-joke/

Support ticket: https://app.hubspot.com/contacts/6291320/record/0-5/2591646987

Via behavior: The article displays for a brief moment and the page goes entirely blank

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

5 participants