pycontw / pycon_archive_past_website

Freeze and archive historical PyConTW official websites as static sites.
MIT License
4 stars 7 forks source link

[Bug Report] Web Crawler Bugs in Archiving #45

Open SivanYeh opened 2 weeks ago

SivanYeh commented 2 weeks ago

Describe the bug A clear and concise description of what the bug is. 2024年發現當年度的歷屆網頁中, 2021年的網頁圖片失效. 詳情請見 PR #44

To Reproduce Steps to reproduce the behavior:

  1. Go to "https://tw.pycon.org/2021/" after archiving
  2. See Error: Imgs missing on pages (e.g missing speaker photos at "https://tw.pycon.org/2021/en-us/conference/talks")
josix commented 1 week ago

補充:2021 之後的網站是來自 nuxt generate 產生的靜態網頁,由於當時是 target /temp server 的 api 提供的圖片路徑,2024年 移除 temp server 後路徑遺失會導致這個問題