internetarchive / wayback-machine-webextension

A web browser extension for Chrome, Firefox, Edge, and Safari 14.
GNU Affero General Public License v3.0
647 stars 207 forks source link

Archiving url with Russian characters doesn't display PDF #560

Open Melonadev opened 4 years ago

Melonadev commented 4 years ago

Archiving a PDF url with Russian characters (see links provided) doesn't display PDF at all.

Relevant urls

Archived: https://web.archive.org/web/20200605023059/http://xn--b1aanbebkbbpfqcbebcaoyded7a1etm.xn--p1ai/tribuna/kachesov.pdf

Current (original url): http://xn--b1aanbebkbbpfqcbebcaoyded7a1etm.xn--p1ai/tribuna/kachesov.pdf or союзмосковскихкомпозиторов.рф/tribuna/kachesov.pdf

To Reproduce

  1. Search the original url above in Wayback Machine.
  2. Go to any snapshot.

Expected behavior The Wayback Machine displays the actual PDF. Here's a successful example.

Actual behavior The Wayback Machine displays its own homepage as its captured page.

Details

Desktop:

tikhsuP commented 4 years ago

Thank you @Melonadev for reporting this issue.

anishsarangi commented 4 years ago

@vbanos Can you check this please? Thank you !

cgorringe commented 4 years ago

I can confirm that this is 1. An issue with web.archive.org, and 2. While also an issue with the Chrome extension, it should automatically work once (1) is fixed.