omkarcloud / botasaurus

The All in One Framework to build Awesome Scrapers.
https://www.omkar.cloud/botasaurus/
MIT License
1.16k stars 104 forks source link

When downloading a binary file like pdf the content is some kind of unicode encoded and cannot be decoded. #45

Closed andrezaiats closed 4 months ago

andrezaiats commented 5 months ago

Do you are aware about this? I cannot download a pdf from a remote site. The headers shows the correct content-length but the content always get bigger (some kind of unicode encoded) but I cannot figure out how to correctly decode it.

Is it possible to correct download a remote pdf from a cloudflare protected site? I guess this happens with all kind of remote binary files...

andrezaiats commented 5 months ago

Still no clue? I haven't figured it out yet...

babynew commented 4 months ago

Can you please share the code ?

Chetan11-dev commented 4 months ago

We do not provide dedicated support for individual problems. We recommend creating a detailed issue on Stack Overflow or in the /r/webscraping/ subreddit on Reddit, where the community can assist you. We hope you understand.