jgm / pandoc

Universal markup converter
https://pandoc.org
Other
34.03k stars 3.35k forks source link

Convert Markdown to Docbook (different outputs) #1328

Closed dorneanu closed 10 years ago

dorneanu commented 10 years ago

Hi there! First I'd like to thank you for such a lovely software! And now my problem:

I'm trying to convert following markdown code into docbook:

# Summary

## Host

http://google.de

# Vulnerabilities

## Cross Site Scripting

### Description

Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do eiusmod
tempor incididunt ut labore et dolore magna aliqua. Ut enim ad minim veniam,
quis nostrud exercitation ullamco laboris nisi ut aliquip ex ea commodo
consequat. Duis aute irure dolor in reprehenderit in voluptate velit esse
cillum dolore eu fugiat nulla pariatur. Excepteur sint occaecat cupidatat non
proident, sunt in culpa qui officia deserunt mollit anim id est laborum.

### Target

http://www.google.com

### Severity

Medium

### PoC

* Input

    ~~~
    <HTML><BODY>
    <?xml:namespace prefix="t" ns="urn:schemas-microsoft-com:time">
    <?import namespace="t" implementation="#default#time2">
    <t:set attributeName="innerHTML" to="XSS<SCRIPT DEFER>alert("XSS")</SCRIPT>">
    </BODY></HTML>
    ~~~

* Output

    ~~~
    <link href="/stylesheets/screen.css" media="screen, projection" rel="stylesheet" type="text/css">

    <!--Fonts from Google"s Web font directory at http://google.com/webfonts -->
    <link href='http://fonts.googleapis.com/css?family=Open+Sans' rel='stylesheet' type='text/css'>
    <link href='http://fonts.googleapis.com/css?family=Jura' rel='stylesheet' type='

Using http://johnmacfarlane.net/pandoc/try/ I get (as expected):

<para>
  `# Summary
</para>
<sect1 id="host">
  <title>Host</title>
  <para>
    http://google.de
  </para>
</sect1>
<sect1 id="vulnerabilities">
  <title>Vulnerabilities</title>
  <sect2 id="cross-site-scripting">
    <title>Cross Site Scripting</title>
    <sect3 id="description">
      <title>Description</title>
      <para>
        Lorem ipsum dolor sit amet, consectetur adipisicing elit, sed do
        eiusmod tempor incididunt ut labore et dolore magna aliqua. Ut
        enim ad minim veniam, quis nostrud exercitation ullamco laboris
        nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor
        in reprehenderit in voluptate velit esse cillum dolore eu fugiat
        nulla pariatur. Excepteur sint occaecat cupidatat non proident,
        sunt in culpa qui officia deserunt mollit anim id est laborum.
      </para>
    </sect3>
    <sect3 id="target">
      <title>Target</title>
      <para>
        http://www.google.com
      </para>
    </sect3>
    <sect3 id="severity">
      <title>Severity</title>
      <para>
        Medium
      </para>
    </sect3>
    <sect3 id="poc">
      <title>PoC</title>
      <itemizedlist>
        <listitem>
          <para>
            Input
          </para>
          <programlisting>
&lt;HTML&gt;&lt;BODY&gt;
&lt;?xml:namespace prefix=&quot;t&quot; ns=&quot;urn:schemas-microsoft-com:time&quot;&gt;
&lt;?import namespace=&quot;t&quot; implementation=&quot;#default#time2&quot;&gt;
&lt;t:set attributeName=&quot;innerHTML&quot; to=&quot;XSS&lt;SCRIPT DEFER&gt;alert(&quot;XSS&quot;)&lt;/SCRIPT&gt;&quot;&gt;
&lt;/BODY&gt;&lt;/HTML&gt;
</programlisting>
        </listitem>
        <listitem>
          <para>
            Output
          </para>
          <para>
            <subscript>~</subscript>
            <link href="/stylesheets/screen.css" media="screen, projection" rel="stylesheet" type="text/css">
          </para>
          <!--Fonts from Google"s Web font directory at http://google.com/webfonts -->
          <para>
            <link href='http://fonts.googleapis.com/css?family=Open+Sans' rel='stylesheet' type='text/css'>
            &lt;link
            href=<quote>http://fonts.googleapis.com/css?family=Jura</quote>
            rel=<quote>stylesheet</quote> type=’
          </para>
        </listitem>
      </itemizedlist>
    </sect3>
  </sect2>
</sect1>

However when I try to convert the same code on my Windows 7 machine I get:

<?xml version="1.0" encoding="utf-8" ?>
<!DOCTYPE article PUBLIC "-//OASIS//DTD DocBook XML V4.5//EN"
                  "http://www.oasis-open.org/docbook/xml/4.5/docbookx.dtd">
<article>
  <articleinfo>
    <title></title>
  </articleinfo>
# Summary ## Host http://google.de # Vulnerabilities ## Cross Site
Scripting ### Description Lorem ipsum dolor sit amet, consectetur
adipisicing elit, sed do eiusmod tempor incididunt ut labore et dolore
magna aliqua. Ut enim ad minim veniam, quis nostrud exercitation ullamco
laboris nisi ut aliquip ex ea commodo consequat. Duis aute irure dolor
in reprehenderit in voluptate velit esse cillum dolore eu fugiat nulla
pariatur. Excepteur sint occaecat cupidatat non proident, sunt in culpa
qui officia deserunt mollit anim id est laborum. ### Target
http://www.google.com ### Severity Medium ### PoC * Input ~~~
&quot;&gt;
~~~ * Output ~~~
</article>

Here are some system information:

bash-3.1$ pandoc -v
pandoc.exe 1.12.4.2
Compiled with texmath 0.6.6.1, highlighting-kate 0.5.8.2.
Syntax highlighting is supported for the following languages:
    actionscript, ada, apache, asn1, asp, awk, bash, bibtex, boo, c, changelog,
    clojure, cmake, coffee, coldfusion, commonlisp, cpp, cs, css, curry, d,
    diff, djangotemplate, doxygen, doxygenlua, dtd, eiffel, email, erlang,
    fortran, fsharp, gcc, gnuassembler, go, haskell, haxe, html, ini, isocpp,
    java, javadoc, javascript, json, jsp, julia, latex, lex, literatecurry,
    literatehaskell, lua, makefile, mandoc, markdown, matlab, maxima, metafont,
    mips, modelines, modula2, modula3, monobasic, nasm, noweb, objectivec,
    objectivecpp, ocaml, octave, pascal, perl, php, pike, postscript, prolog,
    python, r, relaxngcompact, restructuredtext, rhtml, roff, ruby, rust, scala,
    scheme, sci, sed, sgml, sql, sqlmysql, sqlpostgresql, tcl, texinfo, verilog,
    vhdl, xml, xorg, xslt, xul, yacc, yaml
Default user data directory: C:\pandoc
Copyright (C) 2006-2014 John MacFarlane
Web:  http://johnmacfarlane.net/pandoc
This is free software; see the source for copying conditions.  There is no
warranty, not even for merchantability or fitness for a particular purpose.
bash-3.1$

Any ideas? Thanks in advance.

Best regards,

Victor

mpickering commented 10 years ago

What command are you running?

dorneanu commented 10 years ago

I was using:

pandoc -s -S -t docbook test.markdown -o test.xml

On 10. Juni 2014 21:17:13 MESZ, mpickering notifications@github.com wrote:

What command are you running?


Reply to this email directly or view it on GitHub: https://github.com/jgm/pandoc/issues/1328#issuecomment-45658834

Sent from my phone using my brain.

mpickering commented 10 years ago

I can't seem to reproduce this bug. I am using pandoc 1.12.4.2 and the file converts as expected.

dorneanu commented 10 years ago

Hmm..That's strange. I can now convert the file as expected. Currently I'm using git-bash and I run:

$ pandoc -s -S -t docbook test2.markdown -o example.xml

And that works. Previously I've used:

$ pandoc -s -S -t docbook /c/tmp/test2.markdown -o example.xml

... and that didn't work. Using Windows standard cmd also worked:

C:\tmp>pandoc -s -S -t docbook c:\tmp\test2.markdown -o example.xml

Anyway: Thanks for your help! I'll close this issue.