ManifoldScholar / manifold

Transforming scholarly publications into living digital works.
http://manifoldapp.org
GNU General Public License v3.0
236 stars 30 forks source link

Investigate why some word styles aren't allowed through during ingestion #1189

Closed zdavis closed 6 years ago

zdavis commented 6 years ago

word-tester.docx

Some of these styles didn't make it through:

I want to see what the intermediary pandoc HTML looks like, and determine whether it's pandoc that's bungling the conversion, or manifold.

SMaxOwok commented 6 years ago
<!DOCTYPE html PUBLIC \"-//W3C//DTD XHTML 1.0 Transitional//EN\" \"http://www.w3.org/TR/xhtml1/DTD/xhtml1-transitional.dtd\">
<html xmlns=\"http://www.w3.org/1999/xhtml\">
<head>
  <meta http-equiv=\"Content-Type\" content=\"text/html; charset=utf-8\">
  <meta http-equiv=\"Content-Style-Type\" content=\"text/css\">
  <meta name=\"generator\" content=\"pandoc\">
  <meta name=\"author\" content=\"Author Name (Styled Author—Does not render)\">
  <title>Title</title>
  <style type=\"text/css\">code{white-space: pre;}</style>
<style type=\"text/css\">
    .smallcaps { font-variant: small-caps; }
    .underline { text-decoration: underline; }
    .column{ display: inline-block; vertical-align: top; width: 50%; }
    .title { text-align: center; font-size: 2em; }
    .subtitle { text-align: center; font-size: 1.2em; }
    .author { display: none; }
    .date { display: none; }
    tr.header { border-bottom: 1px solid black; }
    tr.even { background-color: #f7f7f7; }
    tr.odd { background-color: #FFF; }
    .footnotes ol li { font-size: 0.8em; }
    .footnote-ref {}
    .footnote-back {}
</style>
</head>
<body>
<div id=\"header\">
<h1 class=\"title\">Title</h1>
<h1 class=\"subtitle\">Subtitle</h1>
<h2 class=\"author\">Author Name <strong>(Styled Author—Does not render)</strong>
</h2>
<h3 class=\"date\">Date: 10 July 10, 2018 <strong>(Styled Date—Does not render)</strong>
</h3>
</div>
<p>Normal paragraph (no space above). <strong>Bold text</strong>. <em>Italic text</em>. <em>Underline text</em>. <em><strong>Bold/italic</strong></em>. <strong><em>Bold/underline.</em></strong> <em><em>Italic/underline</em></em>. <span style=\"font-variant: small-caps;\">Small caps (S capped)</span>. Lorem ipsum dolor sit amet, consectetur adipiscing elit.<sup>superscript</sup> Sed imperdiet dictum sapien ac elementum.<sub>subscript</sub> Morbi leo magna, suscipit in nibh in, dignissim cursus lacus. Curabitur tempor velit erat, eu mattis ex volutpat in. Sed at erat fermentum, ultricies augue ac, tempus risus. Sed lacus sem, blandit eget purus hendrerit, sodales consequat neque. Nulla posuere, magna quis iaculis laoreet, urna sapien pellentesque elit, hendrerit cursus lectus ex non elit. Suspendisse potenti. Praesent posuere justo sit amet finibus tincidunt. Nullam at dictum ante, a mollis purus.<a href=\"#fn1\" class=\"footnoteRef\" id=\"fnref1\"><sup>1</sup></a> Footnote Reference style for note number.</p>
<h1 id=\"heading-1\">Heading 1</h1>
<p>First Paragraph. Duis maximus in metus ac molestie (red text). Integer aliquam sit amet sapien nec posuere. Integer pharetra tortor quis suscipit placerat. Curabitur rutrum consequat sapien, vel scelerisque ipsum maximus non. Morbi diam metus, mattis sit amet orci in, mollis laoreet quam. Aliquam dictum dui in leo bibendum dignissim vel pretium lorem. Vivamus sed finibus lacus.</p>
<p>Body Text paragraph (no space above or manual indent). Vestibulum vehicula, ante sed dignissim euismod, turpis leo laoreet mauris, in vehicula elit ante eget erat. Nullam quis nisi dolor.</p>
<p>Normal paragraph. Tabbed indent. Curabitur quis orci et odio dapibus suscipit quis sit amet mi. Mauris tincidunt lectus varius est vestibulum egestas. Sed accumsan euismod libero, at sollcitudin turpis auctor eu. Aliquam viverra lacus vel leo accumsan sodales.</p>
<h2 id=\"heading-2\">Heading 2</h2>
<p>Mauris at pulvinar dui, et accumsan sapien (highlighted). Vivamus vel augue tellus. Duis dignissim interdum blandit. Aliquam dignissim sapien eu ultrices ultricies. Mauris feugiat placerat turpis, laoreet blandit libero ullamcorper sed.</p>
<p>First line indent: 0.5” (no space above). Integer ultrices, nunc quis aliquet tempor, nibh dolor tincidunt leo, eu facilisis quam erat ac nulla. Integer sed mi eu orci mattis fermentum. Integer non nulla dolor. Fusce justo dui, porttitor quis maximus in, pharetra non ex. Vestibulum ante ipsum primis in faucibus orci luctus et ultrices posuere cubilia Curae; Nulla facilisi.</p>
<ul>
<li><p>Bulleted list item 1 <strong>(no space above)</strong></p></li>
<li><p>Bulleted list item 2</p></li>
<li><p>Bulleted list item 3 <strong>(no space below)</strong></p></li>
</ul>
<h3 id=\"heading-3\">Heading 3</h3>
<p>Praesent at magna interdum, cursus elit eget, viverra nisl. Nunc eu diam sit amet eros tempor pulvinar. Nam eget laoreet lorem, ac auctor enim.<a href=\"#fn2\" class=\"footnoteRef\" id=\"fnref2\"><sup>2</sup></a> Note number is Endnote Reference style.</p>
<ol style=\"list-style-type: decimal\">
<li><p>Number list item 1 <strong>(manual space above)</strong></p></li>
<li><p>Number list item 2</p></li>
<li><p>Number list item 3 <strong>(manual space below)</strong></p></li>
</ol>
<p>Nulla porttitor, enim nec varius lobortis, urna erat pharetra ex, vitae auctor nunc lectus ac velit. Curabitur nulla tellus, sollicitudin vitae consequat eu, pharetra in diam.</p>
<blockquote>
<p>Block text. Quisque et hendrerit sapien, vel tristique nunc. Aenean tempus fringilla leo sit amet dignissim. Fusce vulputate tortor a tellus ultricies aliquet. Curabitur tellus risus, dignissim ut viverra et, pharetra sed neque. Etiam vulputate eleifend lectus, eu sodales nunc cursus sed. Proin elementum nisl vitae sem vehicula mollis. Vestibulum suscipit neque sit amet leo tristique interdum. Morbi odio magna, commodo id sollicitudin nec, vestibulum ut purus. Proin at bibendum felis.</p>
</blockquote>
<h4 id=\"heading-4\">Heading 4</h4>
<p>Donec sodales hendrerit sagittis. Curabitur consectetur gravida ligula, vitae maximus nunc scelerisque ac. Ut vel porta enim. Mauris ultrices luctus sem, a auctor lorem.</p>
<blockquote>
<p><strong>Manually indented paragraph (no space above)</strong>. Proin sit amet ligula quam. Aenean rhoncus arcu eget massa fringilla, nec porttitor eros commodo. Maecenas metus metus, luctus et finibus auctor, tempor quis nisl. Nunc elementum tortor non nisl condimentum tristique. Nullam id dolor sed neque tincidunt facilisis quis nec erat. Praesent rutrum pharetra ex vel cursus. Duis commodo enim accumsan ultricies rutrum. Aenean consequat orci nec fermentum pharetra. Pellentesque pretium libero augue, in cursus nisi elementum vel. Vivamus pretium vel libero egestas mattis. Suspendisse potenti <strong>(no space below)</strong>.</p>
</blockquote>
<p><strong>Spaced at 1.15</strong>. Duis finibus est nisi, sit amet scelerisque lectus pellentesque eget. Curabitur neque ligula, malesuada aliquam nunc vitae, aliquam aliquet ipsum. Nulla vehicula interdum lorem, viverra fringilla urna varius vitae. Sed laoreet mi ut enim molestie auctor. Integer at risus et sem molestie volutpat molestie sed eros. Duis lectus velit, suscipit eget ullamcorper quis, sodales eget massa. Nunc ut auctor mauris, in finibus urna. Aliquam convallis turpis accumsan, pulvinar velit vel, porttitor augue. Ut rutrum quam sed euismod laoreet. Maecenas eget lorem arcu. Vivamus feugiat, sem efficitur hendrerit maximus, nisi nulla sollicitudin ligula, eu maximus nisi eros eget neque. Pellentesque tortor nisl, mollis nec diam pellentesque, dignissim faucibus ipsum. In hac habitasse platea dictumst.</p>
<p><strong>Spaced at 1.5</strong>. Etiam semper eu ex et sollicitudin. Sed eget mi ut est bibendum iaculis sit amet eu purus. Nam egestas porta scelerisque. Donec eleifend justo ut lectus dignissim, nec porta tellus pellentesque. Pellentesque feugiat sollicitudin facilisis. Quisque rhoncus quam neque, in facilisis dolor porta nec. Nam sed lorem nisl. Phasellus non magna ipsum. Duis sit amet facilisis arcu, vitae rutrum dolor. Nulla lorem ipsum, aliquet vitae nibh id, gravida laoreet libero.</p>
<p><strong>Spaced at 2.0</strong>. Morbi ut malesuada arcu, eu ullamcorper nunc. Cras posuere lectus vitae suscipit lacinia. Curabitur a venenatis erat. Nulla eu ornare augue. Suspendisse a erat libero. In scelerisque, lorem eget vestibulum finibus, sapien velit imperdiet nibh, in hendrerit ligula risus eget diam. Nunc posuere diam id molestie tincidunt. Vivamus diam nunc, rutrum egestas euismod quis, finibus id neque. Etiam eget porttitor elit. Fusce vel neque metus. Cras hendrerit varius feugiat. Cras vel vulputate dolor, vel porttitor elit.</p>
<p><strong>Centered</strong>. Quisque sed justo bibendum, lobortis ex sed, feugiat eros. In vestibulum ornare nisl non feugiat. Donec vulputate augue ut lacinia viverra. Etiam leo dolor, mollis eget lectus molestie, elementum mollis erat. Nam tristique eros elit, interdum tristique nibh vehicula in. Vivamus tincidunt metus iaculis quam tempus, eu facilisis sapien congue. Mauris pulvinar justo ut maximus aliquam. Pellentesque sit amet nunc interdum, cursus arcu vel, imperdiet justo. Nunc dignissim bibendum eros id consectetur. Morbi felis ligula, tempor quis scelerisque non, mollis ac justo. Donec ante orci, ultricies at orci ac, pellentesque auctor tortor. Pellentesque porta, urna vitae aliquet vestibulum, mi augue ornare urna, blandit commodo tellus lorem in purus. Nullam maximus placerat ligula. Phasellus quis elit tellus. Morbi rutrum, dui quis finibus blandit, purus nibh varius mi, vel feugiat arcu tellus sed lacus.</p>
<p><strong>Justified</strong>. Ut feugiat ultrices sollicitudin. Fusce eget ultricies felis, in eleifend risus. Fusce ut est eleifend, consectetur turpis ac, vehicula ante. Morbi consequat velit eget semper pulvinar. Quisque pretium placerat nibh sit amet eleifend. Curabitur dictum fermentum ornare. Proin vel urna consequat, commodo arcu sed, auctor metus. Curabitur non turpis euismod, pellentesque sapien faucibus, fringilla urna. Donec ac mauris eu purus tincidunt faucibus vel vitae turpis. Aenean quam lacus, consectetur pretium massa in, lobortis posuere metus. Curabitur molestie eget purus ut varius.</p>
<p><strong>Courier New</strong>. Vivamus tincidunt metus iaculis quam tempus, eu facilisis sapien congue. Mauris pulvinar justo ut maximus aliquam. Pellentesque sit amet nunc interdum, cursus arcu vel, imperdiet justo. Nunc dignissim bibendum eros id consectetur. Morbi felis ligula, tempor quis scelerisque non, mollis ac justo. Donec ante orci, ultricies at orci ac, pellentesque auctor tortor. Pellentesque porta, urna vitae aliquet vestibulum, mi augue ornare urna, blandit commodo tellus lorem in purus.</p>
<p><strong>The Good Life (Poem title bold, no style)</strong></p>
<p>When some people talk about money <strong>(no indent)</strong></p>
<blockquote>
<p>They speak as if it were a mysterious lover <strong>(Left indent 0.25”)</strong></p>
<p>Who went out to buy milk and never <strong>(Left indent 0.25”)</strong></p>
<p>Came back, and it makes me nostalgic <strong>(Left indent 0.75”)</strong></p>
<p>For the years I lived on coffee and bread, <strong>(Left indent 0.75”)</strong></p>
<p>Hungry all the time, walking to work on payday <strong>(Left indent 1.13”)</strong></p>
<p>Like a woman journeying for water <strong>(Left indent 1.13”)</strong></p>
<p>From a village without a well, then living <strong>(Left indent 1.31”)</strong></p>
<p>One or two nights like everyone else <strong>(Left indent 1.31”)</strong></p>
</blockquote>
<p>On roast chicken and red wine. <strong>(no indent)</strong></p>
<p>—Tracy K. Smith <strong>(Right-aligned)</strong></p>
<h5 id=\"reference-list-heading-5\">Reference List (Heading 5)</h5>
<h6 id=\"hanging-indent-0.5-heading-6\">Hanging Indent 0.5” (Heading 6)</h6>
<p>Comaroff, Jean, and John Comaroff, eds. 1993. Modernity and Its Malcontents: Ritual and Power in Postcolonial Africa. Chicago: University of Chicago Press.</p>
<p>Comaroff, John, and Jean Comaroff. 1991–97. Of Revelation and Revolution. 2 vols. Chicago: University of Chicago Press.</p>
<h1 id=\"paragraph-styles-heading-1\">Paragraph Styles (Heading 1)</h1>
<p>Title</p>
<p>Subtitle</p>
<p>Author</p>
<p>Date</p>
<p>Abstract</p>
<h1 id=\"heading-1-1\">Heading 1</h1>
<h2 id=\"heading-2-1\">Heading 2</h2>
<h3 id=\"heading-3-1\">Heading 3</h3>
<h4 id=\"heading-4-1\">Heading 4</h4>
<h5 id=\"heading-5\">Heading 5</h5>
<h6 id=\"heading-6\">Heading 6</h6>
<p id=\"heading-7\">Heading 7</p>
<p id=\"heading-8\">Heading 8</p>
<p id=\"heading-9\">Heading 9</p>
<p>First Paragraph. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Curabitur sit amet lectus odio. Nunc vulputate dignissim viverra. Nunc fringilla lorem et dolor sagittis, vel tincidunt lorem consectetur. Duis nisi metus, laoreet nec dui vel, semper sodales lectus. Sed feugiat tellus velit, at mattis purus dictum at. Phasellus sed elit vehicula, scelerisque nisl eu, fringilla ante. Etiam at elit dolor. Donec venenatis, sem quis porta pharetra, libero felis vehicula nisl, eget blandit massa nibh vel elit. Maecenas in est at sem aliquam ultrices.</p>
<p>Body Text. Mauris pulvinar neque nisi, sed maximus est pellentesque vitae. Pellentesque id ante mi. Praesent laoreet libero purus, convallis ultrices augue dictum interdum. Quisque laoreet, ligula nec porta aliquet, purus quam tempor risus, nec blandit libero ligula eu mi. Nulla sed diam vulputate, ultricies sapien non, convallis purus. Sed in finibus est, nec porta augue. Nam suscipit tortor ac nunc mollis ullamcorper. Suspendisse at sodales arcu. Etiam aliquet sem et nulla cursus rhoncus.</p>
<blockquote>
<p>Block Text. Sed id leo vulputate, rutrum libero vitae, condimentum sapien. Quisque malesuada nulla vitae quam blandit efficitur. Curabitur ultrices quam neque, vitae congue eros mollis in. Nam ac elementum eros, eu bibendum lacus. Maecenas lacinia urna eget elit commodo molestie. Curabitur commodo, lacus in cursus dictum, mauris lorem egestas eros, sed sollicitudin lorem libero ut quam. Proin pharetra mollis sapien sit amet scelerisque. Ut faucibus nisl nec lacinia aliquam. Etiam id turpis scelerisque, posuere neque vel, porttitor nulla. Sed condimentum non libero sit amet ornare. Proin convallis scelerisque tincidunt. Orci varius natoque penatibus et magnis dis parturient montes, nascetur ridiculus mus. Sed gravida vulputate dui, dignissim eleifend lacus lacinia a. Praesent efficitur tempor enim eu sodales. Aliquam posuere, nunc sed ornare maximus, mi elit auctor nisl, nec viverra dolor lacus ac ex.</p>
</blockquote>
<p>Image caption. Copyright Cast Iron Coding.</p>
<p>Table caption. Phasellus accumsan turpis metus.</p>
<table>
<thead>
<tr class=\"header\">
<th>Col. 1, Row 1 (Table Grid)</th>
<th>Col. 2, Row 1 (Table Grid)</th>
<th>Col. 3, Row 1 (Table Grid)</th>
</tr>
</thead>
<tbody>
<tr class=\"odd\">
<td>Col. 1, Row 2 (Table Grid)</td>
<td>Col. 2, Row 2 (Table Grid)</td>
<td>Col. 3, Row 2 (Table Grid)</td>
</tr>
<tr class=\"even\">
<td>Col. 1, Row 3 (Table Grid)</td>
<td>Col. 2, Row 3 (Table Grid)</td>
<td>Col. 3, Row 3 (Table Grid)</td>
</tr>
<tr class=\"odd\">
<td>Col. 1, Row 4 (Table Grid)</td>
<td>Col. 2, Row 4 (Table Grid)</td>
<td>Col. 3, Row 4 (Table Grid)</td>
</tr>
<tr class=\"even\">
<td>Col. 1, Row 5 (Table Grid)</td>
<td>Col. 2, Row 5 (Table Grid)</td>
<td>Col. 3, Row 5 (Table Grid)</td>
</tr>
</tbody>
</table>
<p>Footnote Text</p>
<div class=\"Definition\">
<p>Definition</p>
</div>
<h1 id=\"character-styles\">Character Styles</h1>
<p><em>Italics</em></p>
<p><strong>Bold</strong></p>
<p><em>Underline</em></p>
<p><em><strong>Bold/Italics</strong></em></p>
<p><strong><em>Bold/Underline</em></strong></p>
<p><em><em>Italic/Underline</em></em></p>
<p>Superscript<sup>3</sup></p>
<p>Subscript<sub>4</sub></p>
<p><del>Strikethrough</del></p>
<p>Body Text Char. Cannot Assign</p>
<p><code>Verbatim Char</code>. Etiam vestibulum neque nec ipsum feugiat auctor.</p>
<p><a href=\"https://manifoldapp.org\">Hyperlink</a></p>
<p>Footnote Reference.<sup>2</sup></p>
<p>Definition Term. Etiam vestibulum neque nec ipsum feugiat auctor.</p>
<div class=\"footnotes\">
<hr>
<ol>
<li id=\"fn1\"><p>This is a footnote (Style: Footnote Text).<a href=\"#fnref1\">↩</a></p></li>
<li id=\"fn2\"><p>This is an endnote (Style: Endnote Text).<a href=\"#fnref2\">↩</a></p></li>
</ol>
</div>
</body>
</html>
SMaxOwok commented 6 years ago

I can address some of these:

zdavis commented 6 years ago

@tsmyre :point_up: Some more info above on why the word ingestion behaved the way it did.

zdavis commented 6 years ago

Thanks, Max, this is helpful. Because the issue is with Pandoc and not with Manifold, I don't see us acting on these shortcomings at this time. Closing this.

tsmyre commented 6 years ago

@SMaxOwok Did you adjust the stylesheet to address the date and abstract being hidden? I'm working in Edge right now and those still aren't coming through.