rajatomar788 / pywebcopy

Locally saves webpages to your hard disk with images, css, js & links as is.
https://rajatomar788.github.io/pywebcopy/
Other
546 stars 108 forks source link

File ext '' is not allowed for file at #3

Closed aviraonepiece closed 5 years ago

aviraonepiece commented 5 years ago

`#!/usr/bin/python

-- coding:utf-8 --

from pywebcopy import WebPage

url = 'http://baijiahao.baidu.com/s?id=1622464741855375146' project_loc = 'C:/Users/康康/Desktop'

wp = WebPage(url, project_folder=project_loc, default_encoding=None, HTML=None, bypass_robots=True )

wp.save_complete()`

i got CRITICAL - pywebcopy.core.new_file:189 - File ext '' is not allowed for file at 'http://t10.baidu.com/it/u=788283957,2664293696&fm=173&app=49&f=JPEG?w=528&h=446&s=27C8D010495E60CC40E4045A0300C0F2'

but i have set bypass_robots

rajatomar788 commented 5 years ago

Hey! It's working fine. Its just not recognizing the file type of a dynamicly generated image which is hard in real.

There is no relation between bypass_robots and file type not being recognized.

But I will definitely write a patch in future with better support for dynamic files.

Thank you..

On Saturday, 12 January 2019, Mr.Fake notifications@github.com wrote:

`#!/usr/bin/python -- coding:utf-8 --

from pywebcopy import WebPage

url = 'http://baijiahao.baidu.com/s?id=1622464741855375146' project_loc = 'C:/Users/康康/Desktop'

wp = WebPage(url, project_folder=project_loc, default_encoding=None, HTML=None, bypass_robots=True ) You can choose to load the page explicitly using requests module if you want complete webpage

wp.save_complete()`

i got CRITICAL - pywebcopy.core.new_file:189 - File ext '' is not allowed for file at 'http://t10.baidu.com/it/u=788283957,2664293696&fm=173& app=49&f=JPEG?w=528&h=446&s=27C8D010495E60CC40E4045A0300C0F2'

but i have set bypass_robots

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/rajatomar788/pywebcopy/issues/3, or mute the thread https://github.com/notifications/unsubscribe-auth/Ag0mzpXtrNDt-g79B-xZY7lxHrhuGb1Fks5vChJtgaJpZM4Z8vX4 .