Open hossman opened 7 years ago
I feel like we'd have to look at them all anyway to be sure we got them right. There is a short list of pages with this problem - they are output from the ant build-jekyll
target as warnings. A quick count shows maybe 15 pages?
yeah, maybe manual audit/cleanup is easiest ... i just wanted to point out there is a (fairly straight forward) automated solution to this problem we could consider.
unlike confluence, which was happy to let us have a page start with an
h3
, and/or have pages with section headings usingh2
followed by subsections usingh4
, asciidoctor generally frowns on this and gives lots of warnings because of it.If we want to try to clean this up, then doing it as part of the
ScrapeConfluence.java
HTML cleanup code we already have (when doing our HTML cleanup on the cwiki export) would probably be the most straight forward place to do it ... otherwise i think we'd have to manually cleanup the adoc files (so```me creative grepping would at least let us quickly scan files visually looking for discrepencies)If we want to do this in conversion code, then what i think would work pretty easily is something like the following psuedo code...
(NOTE: might be an off by one error there, can't remember if the html->adoc conversion assumes/expects that we won't use any "h1" tags in the body of pages)