Open pyprism opened 1 year ago
Describe the bug Sometimes wayback saving google ad or facebook/twitter plugin urls. If I tried to re-save again, sometimes it succeeded to save the real url, sometimes it returned previous ad/plugin URLs. I am using version 3.0.6 on linux.
some sample URLs:
returned URL: https://web.archive.org/web/20211228103331/https://googleads.g.doubleclick.net/pagead/ads?client=ca-pub-3924910931647576&output=html&adk=1812271804&adf=3025194257&lmt=1640687592&plat=3%3A32%2C4%3A32%2C9%3A32768%2C16%3A8388608%2C17%3A32%2C24%3A32%2C25%3A32%2C30%3A1048576%2C32%3A32&format=0x0&url=https%3A%2F%2Fwww.ittefaq.com.bd%2F307890%2F%25E2%2580%2598%25E0%25A6%25B7%25E0%25A7%259C%25E0%25A6%25AF%25E0%25A6%25A8%25E0%25A7%258D%25E0%25A6%25A4%25E0%25A7%258D%25E0%25A6%25B0-%25E0%25A6%2595%25E0%25A6%25B0%25E0%25A7%2587-%25E0%25A6%2586%25E0%25A6%2593%25E0%25A7%259F%25E0%25A6%25BE%25E0%25A6%25AE%25E0%25A7%2580-%25E0%25A6%25B2%25E0%25A7%2580%25E0%25A6%2597%25E0%25A7%2587%25E0%25A6%25B0-%25E0%25A6%2585%25E0%25A6%2597%25E0%25A7%258D%25E0%25A6%25B0%25E0%25A6%25AF%25E0%25A6%25BE%25E0%25A6%25A4%25E0%25A7%258D%25E0%25A6%25B0%25E0%25A6%25BE-%25E0%25A6%25AC%25E0%25A7%258D%25E0%25A6%25AF%25E0%25A6%25BE%25E0%25A6%25B9%25E0%25A6%25A4-%25E0%25A6%2595%25E0%25A6%25B0%25E0%25A6%25BE&ea=0&flash=0&pra=5&wgl=1&dt=1640687598320&bpp=9&bdt=5517&idt=12669&shv=r20211207&mjsv=m202112060101&ptt=9&saldr=aa&abxe=1&nras=1&correlator=2609084653866&frm=20&pv=2&ga_vid=1020869695.1640687611&ga_sid=1640687611&ga_hid=2057842903&ga_fc=0&u_tz=0&u_his=50&u_h=1000&u_w=1600&u_ah=1000&u_aw=1600&u_cd=24&u_sd=1&dmc=8&adx=-12245933&ady=-12245933&biw=1085&bih=13431&scr_x=0&scr_y=0&eid=44750774%2C31063858&oid=2&pvsid=1561880799211149&pem=304&tmod=686&eae=2&fc=1920&brdim=10%2C10%2C10%2C10%2C1600%2C0%2C1100%2C900%2C1085%2C13431&vis=1&rsz=%7C%7Cs%7C&abl=NS&fu=32768&bc=31&ifi=1&uci=a!1&fsb=1&dtd=12726
main url : https://www.banglatribune.com/708433/%E0%A7%AE%E0%A7%AA-%E0%A6%B0%E0%A6%BE%E0%A6%A8%E0%A7%87%E0%A6%B0-%E0%A6%AC%E0%A7%9C-%E0%A6%9C%E0%A7%9F%E0%A7%87-%E0%A6%B8%E0%A7%81%E0%A6%AA%E0%A6%BE%E0%A6%B0-%E2%80%8D%E0%A6%9F%E0%A7%81%E0%A7%9F%E0%A7%87%E0%A6%B2%E0%A6%AD%E0%A7%87-%E0%A6%AC%E0%A6%BE%E0%A6%82%E0%A6%B2%E0%A6%BE%E0%A6%A6%E0%A7%87%E0%A6%B6 returned url: https://web.archive.org/web/20211021135016/https://platform.twitter.com/widgets/widget_iframe.a53eecb4584348a2ad32ec2ae21f6eae.html?origin=https%3A%2F%2Fwww.banglatribune.com
main url: https://www.ittefaq.com.bd/307148/%E0%A6%AD%E0%A6%BE%E0%A6%B0%E0%A6%A4%E0%A6%95%E0%A7%87-%E0%A6%B9%E0%A6%BE%E0%A6%B0%E0%A6%BF%E0%A7%9F%E0%A7%87-%E0%A6%9A%E0%A7%8D%E0%A6%AF%E0%A6%BE%E0%A6%AE%E0%A7%8D%E0%A6%AA%E0%A6%BF%E0%A7%9F%E0%A6%A8-%E0%A6%AC%E0%A6%BE%E0%A6%82%E0%A6%B2%E0%A6%BE%E0%A6%A6%E0%A7%87%E0%A6%B6%E0%A7%87%E0%A6%B0-%E0%A6%AE%E0%A7%87%E0%A7%9F%E0%A7%87%E0%A6%B0%E0%A6%BE returned url : https://web.archive.org/web/20211222161510/https://www.facebook.com/plugins/feedback.php?app_id=291494419107518&channel=https%3A%2F%2Fstaticxx.facebook.com%2Fx%2Fconnect%2Fxd_arbiter%2F%3Fversion%3D46%23cb%3Df18ef94141bd42%26domain%3Dwww.ittefaq.com.bd%26is_canvas%3Dfalse%26origin%3Dhttps%253A%252F%252Fwww.ittefaq.com.bd%252Ff1f7b27adffda28%26relation%3Dparent.parent&container_width=571&height=100&href=https%3A%2F%2Fwww.ittefaq.com.bd%2F307052%2F%25E0%25A6%258F%25E0%25A6%2595-%25E0%25A6%25AE%25E0%25A7%258D%25E0%25A6%25AF%25E0%25A6%25BE%25E0%25A6%259A-%25E0%25A6%25AA%25E0%25A6%25B0-%25E0%25A6%2586%25E0%25A6%25AC%25E0%25A6%25BE%25E0%25A6%25B0%25E0%25A6%2593-%25E0%25A6%25AC%25E0%25A6%25BE%25E0%25A6%25B0%25E0%25A7%258D%25E0%25A6%25B8%25E0%25A7%2587%25E0%25A6%25B2%25E0%25A7%258B%25E0%25A6%25A8%25E0%25A6%25BE%25E0%25A6%25B0-%25E0%25A6%25B9%25E0%25A7%258B%25E0%25A6%259A%25E0%25A6%259F&locale=en_US&numposts=5&sdk=joey&version=v12.0&width=550
Describe the bug Sometimes wayback saving google ad or facebook/twitter plugin urls. If I tried to re-save again, sometimes it succeeded to save the real url, sometimes it returned previous ad/plugin URLs. I am using version 3.0.6 on linux.
some sample URLs:
returned URL: https://web.archive.org/web/20211228103331/https://googleads.g.doubleclick.net/pagead/ads?client=ca-pub-3924910931647576&output=html&adk=1812271804&adf=3025194257&lmt=1640687592&plat=3%3A32%2C4%3A32%2C9%3A32768%2C16%3A8388608%2C17%3A32%2C24%3A32%2C25%3A32%2C30%3A1048576%2C32%3A32&format=0x0&url=https%3A%2F%2Fwww.ittefaq.com.bd%2F307890%2F%25E2%2580%2598%25E0%25A6%25B7%25E0%25A7%259C%25E0%25A6%25AF%25E0%25A6%25A8%25E0%25A7%258D%25E0%25A6%25A4%25E0%25A7%258D%25E0%25A6%25B0-%25E0%25A6%2595%25E0%25A6%25B0%25E0%25A7%2587-%25E0%25A6%2586%25E0%25A6%2593%25E0%25A7%259F%25E0%25A6%25BE%25E0%25A6%25AE%25E0%25A7%2580-%25E0%25A6%25B2%25E0%25A7%2580%25E0%25A6%2597%25E0%25A7%2587%25E0%25A6%25B0-%25E0%25A6%2585%25E0%25A6%2597%25E0%25A7%258D%25E0%25A6%25B0%25E0%25A6%25AF%25E0%25A6%25BE%25E0%25A6%25A4%25E0%25A7%258D%25E0%25A6%25B0%25E0%25A6%25BE-%25E0%25A6%25AC%25E0%25A7%258D%25E0%25A6%25AF%25E0%25A6%25BE%25E0%25A6%25B9%25E0%25A6%25A4-%25E0%25A6%2595%25E0%25A6%25B0%25E0%25A6%25BE&ea=0&flash=0&pra=5&wgl=1&dt=1640687598320&bpp=9&bdt=5517&idt=12669&shv=r20211207&mjsv=m202112060101&ptt=9&saldr=aa&abxe=1&nras=1&correlator=2609084653866&frm=20&pv=2&ga_vid=1020869695.1640687611&ga_sid=1640687611&ga_hid=2057842903&ga_fc=0&u_tz=0&u_his=50&u_h=1000&u_w=1600&u_ah=1000&u_aw=1600&u_cd=24&u_sd=1&dmc=8&adx=-12245933&ady=-12245933&biw=1085&bih=13431&scr_x=0&scr_y=0&eid=44750774%2C31063858&oid=2&pvsid=1561880799211149&pem=304&tmod=686&eae=2&fc=1920&brdim=10%2C10%2C10%2C10%2C1600%2C0%2C1100%2C900%2C1085%2C13431&vis=1&rsz=%7C%7Cs%7C&abl=NS&fu=32768&bc=31&ifi=1&uci=a!1&fsb=1&dtd=12726
main url : https://www.banglatribune.com/708433/%E0%A7%AE%E0%A7%AA-%E0%A6%B0%E0%A6%BE%E0%A6%A8%E0%A7%87%E0%A6%B0-%E0%A6%AC%E0%A7%9C-%E0%A6%9C%E0%A7%9F%E0%A7%87-%E0%A6%B8%E0%A7%81%E0%A6%AA%E0%A6%BE%E0%A6%B0-%E2%80%8D%E0%A6%9F%E0%A7%81%E0%A7%9F%E0%A7%87%E0%A6%B2%E0%A6%AD%E0%A7%87-%E0%A6%AC%E0%A6%BE%E0%A6%82%E0%A6%B2%E0%A6%BE%E0%A6%A6%E0%A7%87%E0%A6%B6 returned url: https://web.archive.org/web/20211021135016/https://platform.twitter.com/widgets/widget_iframe.a53eecb4584348a2ad32ec2ae21f6eae.html?origin=https%3A%2F%2Fwww.banglatribune.com
main url: https://www.ittefaq.com.bd/307148/%E0%A6%AD%E0%A6%BE%E0%A6%B0%E0%A6%A4%E0%A6%95%E0%A7%87-%E0%A6%B9%E0%A6%BE%E0%A6%B0%E0%A6%BF%E0%A7%9F%E0%A7%87-%E0%A6%9A%E0%A7%8D%E0%A6%AF%E0%A6%BE%E0%A6%AE%E0%A7%8D%E0%A6%AA%E0%A6%BF%E0%A7%9F%E0%A6%A8-%E0%A6%AC%E0%A6%BE%E0%A6%82%E0%A6%B2%E0%A6%BE%E0%A6%A6%E0%A7%87%E0%A6%B6%E0%A7%87%E0%A6%B0-%E0%A6%AE%E0%A7%87%E0%A7%9F%E0%A7%87%E0%A6%B0%E0%A6%BE returned url : https://web.archive.org/web/20211222161510/https://www.facebook.com/plugins/feedback.php?app_id=291494419107518&channel=https%3A%2F%2Fstaticxx.facebook.com%2Fx%2Fconnect%2Fxd_arbiter%2F%3Fversion%3D46%23cb%3Df18ef94141bd42%26domain%3Dwww.ittefaq.com.bd%26is_canvas%3Dfalse%26origin%3Dhttps%253A%252F%252Fwww.ittefaq.com.bd%252Ff1f7b27adffda28%26relation%3Dparent.parent&container_width=571&height=100&href=https%3A%2F%2Fwww.ittefaq.com.bd%2F307052%2F%25E0%25A6%258F%25E0%25A6%2595-%25E0%25A6%25AE%25E0%25A7%258D%25E0%25A6%25AF%25E0%25A6%25BE%25E0%25A6%259A-%25E0%25A6%25AA%25E0%25A6%25B0-%25E0%25A6%2586%25E0%25A6%25AC%25E0%25A6%25BE%25E0%25A6%25B0%25E0%25A6%2593-%25E0%25A6%25AC%25E0%25A6%25BE%25E0%25A6%25B0%25E0%25A7%258D%25E0%25A6%25B8%25E0%25A7%2587%25E0%25A6%25B2%25E0%25A7%258B%25E0%25A6%25A8%25E0%25A6%25BE%25E0%25A6%25B0-%25E0%25A6%25B9%25E0%25A7%258B%25E0%25A6%259A%25E0%25A6%259F&locale=en_US&numposts=5&sdk=joey&version=v12.0&width=550