The wikicreole parser currently emit too many B.phrasing elements, eventually splitting words into pieces.
For example, with input the, the parser emits B.phrasing for t and he. The reason seems to be that there is a rule for parsing http:..., the parser stops after t in case the h is the start of http:....
The other aspect that is weird to me is that tyxml can generate files with different browser rendering with and without indent.
I think it boills down to the following : printing [ pcdata "a"; pcdata "b"] inserts a cutting hint between 'a' and 'b'. Format can decide to insert newlines if the text is too long. The browser renders "a\nb" and "ab" differently.
The wikicreole parser currently emit too many
B.phrasing
elements, eventually splitting words into pieces.For example, with input
the
, the parser emitsB.phrasing
fort
andhe
. The reason seems to be that there is a rule for parsinghttp:...
, the parser stops aftert
in case theh
is the start ofhttp:...
.The other aspect that is weird to me is that tyxml can generate files with different browser rendering with and without
indent
. I think it boills down to the following : printing[ pcdata "a"; pcdata "b"]
inserts a cutting hint between 'a' and 'b'.Format
can decide to insert newlines if the text is too long. The browser renders "a\nb" and "ab" differently.