pts / pdfsizeopt

PDF file size optimizer
GNU General Public License v2.0
764 stars 66 forks source link

Input PDF on which using Multivalent makes the output much smaller #53

Open rbrito opened 7 years ago

rbrito commented 7 years ago

Dear @pts,

You previously asked to send you files where using Multivalent made some significant change in terms of size and I just found one where it does, considerably.

I will send you the file privately, but here is the output of sizes of the processed files:

$ ls -lgo *.pdf
-rw-r--r-- 1 15215702 Oct  5 15:09 numerical-analysis.pdf
-rw-r--r-- 1 12421574 Oct 10 16:40 numerical-analysis.pso.pdf
-rw-r--r-- 1  7553461 Oct 10 16:40 numerical-analysis.psom.pdf

Note that the file with Multivalent is about half the size of the one generated by pdfsizeopt alone, which is a bit unusual nowadays (files with only pdfsizeopt have started "winning" in terms of size, which is a testament to the quality of your tool!).

Thanks,

Rogério.

pts commented 7 years ago

Thank you! Please share more files like this as you find them! They highlight an improvement (extension) opportunity for pdfsizeopt.

Please note that it's impossible for pdfsizeopt (without Multivalent) to win over pdfsizeopt+Multivalent, because the optimizations done by Multivalent don't increase the file size. Thus it can only happen that there is very little additional gain of running Multivalent after pdfsizeopt, but there won't be an blowup. (There may be some occasional exceptions, it would be interesting to find such PDF files.)

This white paper mentions some optimizations Multivalent does: http://multivalent.sourceforge.net/Research/TwoDietPlans.pdf :

pts commented 7 years ago

Oddly enough, Multivalent is very slow (slower than 10 minutes) for me on numerical-analysis.pdf, possibly it's running in an infinite loop. Could you please attach the console output of time ../pdfsizeopt --use-multivalent=yes --use-pngout=no numerical-analysis.pdf? Please also send me your output file numerical-analysis.psom.pdf. Can you notice any visual difference?

pts commented 7 years ago

I managed to run Multivalent on numerical-analysis.pdf, and I can reproduce your results. No need to send any files.