tmtsoftware / icd

ICD - Interface Control Document Management
Other
12 stars 5 forks source link

Validate the contents of description fields, checking that they can generate valid html and pdf #28

Closed abrighton closed 8 years ago

abrighton commented 8 years ago

The contents of description (and other) fields can contain Mardown and/or HTML markup (Markdown allows embedded HTML). Browsers are lenient with poorly formatted HTML, but the PDF generator is not and requires balanced elements (XHTML style). The icd validator should also validate the contents of these fields to make sure there are no errors later when generating PDF documents.

abrighton commented 8 years ago

I checked in a fix that avoids having to validate the HTML. The code now uses "jsoup" to sanitize the HTML, turning old style HTML it into valid (and more secure) XHTML, as needed by the pdf generating code.

I also added code to strip leading white space from the markdown input. If you want a literal block, you will have to use the triple back quotes.