serg472 / htmlcompressor

HTML Compressor and Minifier, can be used standalone and as a Java library
Apache License 2.0
40 stars 14 forks source link

Russian utf8 chars breaking #25

Closed GoogleCodeExporter closed 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?
1. compress html that contains russian utf8 char "И" (D0 98)

What is the expected output? What do you see instead?
Hmmm. Expected the same char (D0 98) without any modifications, but (D0 F3) 
appears instead

What version of the product are you using? On what operating system?
win7 x64, Java(TM) SE Runtime Environment (build 1.6.0_22-b04), 
htmlcompressor-0.9.8.jar

Please provide any additional information below.

Original issue reported on code.google.com by a.tsvet...@gmail.com on 6 Jan 2011 at 2:12

GoogleCodeExporter commented 9 years ago
Are you using command line compressor? Please provide your command line 
parameters.

Original comment by serg472@gmail.com on 6 Jan 2011 at 4:59

GoogleCodeExporter commented 9 years ago
Yep, command line:
java -jar htmlcompressor-0.9.8.jar 1.phtml -o 1.1.phtml
UTF8 without BOM

Original comment by a.tsvet...@gmail.com on 6 Jan 2011 at 6:50

GoogleCodeExporter commented 9 years ago
You can specify charset through a parameter:

java -jar htmlcompressor-0.9.8.jar --charset UTF-8 1.phtml -o 1.1.phtml

If this doesn't work please create a txt file with that letter and attach it to 
this thread.

Original comment by serg472@gmail.com on 6 Jan 2011 at 7:09

GoogleCodeExporter commented 9 years ago
Oh, it works. Thnx and sorry, need to read the manual :)

Original comment by a.tsvet...@gmail.com on 6 Jan 2011 at 7:30

GoogleCodeExporter commented 9 years ago
No problem, glad it helped :)

Original comment by serg472@gmail.com on 6 Jan 2011 at 7:50