teambob / docfrac

DocFrac is a document converter that can convert between RTF, HTML and ASCII text. This includes RTF to HTML and HTML to RTF. Supports text formatting (e.g. bold); tables; and most European languages. Available for Windows; Linux; ActiveX and DLL.
GNU General Public License v2.0
13 stars 6 forks source link

[Feature] conversion to HTML fails to change consecutive spaces   [sf#8] #52

Open teambob opened 9 years ago

teambob commented 9 years ago

*Reported by anonymous on 2005-05-20 09:27 UTC** If the input is indented, the output HTML looks like this:-

<HTML> <BODY> This <br> is <br> indented <br> here <br></BODY> </HTML>

Which is wrong. What it should look like this this:-

<HTML> <BODY> This <br>&nbsp;is <br>&nbsp;&nbsp;indented <br>&nbsp;&nbsp;&nbsp;here <br></BODY> </HTML>

teambob commented 9 years ago

Commented by andrewpunch on 2005-05-23 06:23 UTC Logged In: YES user_id=928005

Multiple spaces must be encoded as &nbsp;

See: http://www.w3.org/TR/html4/struct/text.html\#h-9.1

"In particular, user agents should collapse input white space sequences when producing output inter-word space."

i.e. Multiple whitespaces in a row should be treated as a single white space.

teambob commented 9 years ago

Commented by andrewpunch on 2005-05-23 06:29 UTC Logged In: YES user_id=928005

Will be fixed in next version

teambob commented 9 years ago

Commented by andrewpunch on 2006-10-16 01:38 UTC Logged In: YES user_id=928005

Fixed. The rtf reader does not support indentation so I will move this item to a feature request.

teambob commented 9 years ago

Updated by andrewpunch on 2006-10-16 01:38 UTC