tfr00 / ExactPDF

Exact Editions PDF Downloader - Shell Script
https://tfr00.github.io/ExactPDF
0 stars 1 forks source link

now output #2

Open beta2k opened 3 years ago

beta2k commented 3 years ago

I use this on windows with Git Bash which should be fine. I enter the issue id from the URL and also past the Cli-Command which looks as follows.

curl --header 'Host: reader.exacteditions.com' --user-agent 'Mozilla/5.0 (Windows NT 10.0; Win64; x64; rv:88.0) Gecko/20100101 Firefox/88.0' --header 'Accept: text/html,application/xhtml+xml,application/xml;q=0.9,image/webp,/;q=0.8' --header 'Accept-Language: en-US,en;q=0.5' --referer 'https://reader.exacteditions.com/issues/93776/page/1' --cookie '_ga_VCC34BJ1S3=GS1.1.1621203040.2.1.1621203048.0; _ga=GA1.2.542087124.1621197936; _gid=GA1.2.688189537.1621197936; ee_session=WEl4YTBPU2VHa2wwbzRYUko4WGlmNUMyV0xzYkVMMVoxRHhINzlBbXFadE5qUCsyL3E1bzR4RlU3Qk9nV0g1Zi0telVvSUxSK3BIL1lVY2pZY04rVjRadz09--58403ca48d1246b885eb608b51d599aca528d0ef; _reader_session=K%2BLqhGAfD4pVoE%2Bq5prDPA3uFVUvRxFwMPz6es2n5%2FkUCAFClTBcRA01I2YVLT1%2F9w%2B1AFfKgQYP2Sj4j1U4SZLQ3n4qnZ1lqiUtOdXUBL%2FI7x96sfwqV%2FY0WMLPgnXL7%2Bq2Shau%2F64gdubyslc%3D--Yz5lyWJyLdD6YPmO--SqHxsT2ESMhzeTl9OOPSIA%3D%3D; _gat=1; _gat_publisher=1; _gat_additional0=1' --header 'Upgrade-Insecure-Requests: 1' 'https://reader.exacteditions.com/issues/93776/page/1.pdf' --output 'page_OFC.pdf'

Then the terminal windows shows a lot of gibberish output like this:

image

This goes on for very very long. I wanted to download 100 pages and it took more than an hour. I observed the folder and I saw that a download.txt file gets updated frequently with the download URLs for the PDFs. but in the end, nothing seems to be stored.

Could it be the case, that the binary PDF output is piped to the console output of the Git Bash terminal (hence the gibberish in the output)? If yes, is there some way to get this to work under windows?

tfr00 commented 3 years ago

I'm not familiar with Git Bash in detail. "Could it be the case, that the binary PDF output is piped to the console output of the Git Bash terminal (hence the gibberish in the output)?" That is what I think of too. Please check if there are some limits to Curl command on the Git Bash; I know of some restrictions with it (e.g. entering login credentials might cause problems). Maybe try the Windows Subsystem for Linux, on the WSL the Curl command and file operations will work as intended.

beta2k commented 3 years ago

Thanks, will try. Do you know if there is a way to remove the "watermark" from the downloaded pdfs? On the bottom of the PDF I see my name (because I am the subscriber).

tfr00 commented 3 years ago

Please give it a try. As this is only for private usage, I never thought of removing the name / watermarks. But I'm sure there is a solution for this type of problem also.

giampi75 commented 2 years ago

Sorry, I'm not very expert. I try with Git Bash, but when I tape Enter after I past the Cli-Command the command window disappear and nothing happens