news-catalyst / google-app-scripts

a collection of scripts for tinynews google docs add-ons
https://script.google.com/a/newscatalyst.org/d/1ILURq69o3cYUy6k1n1X6HwxdMfl9xWNhILYuZxgLfeblb3IR15WCMZSj/edit
1 stars 1 forks source link

Fixes for Word formatting problems on headings and main images #370

Closed jacqui closed 2 years ago

jacqui commented 2 years ago

Test in the script editor with version 106 - there's a test case for the example doc at this version already.

Using this example BBG article I was able to fix two problems related to microsoft word formatting in the doc:

  1. the main image now comes through in the published article
  2. the strange 2 line "HEADING_2" is now parsed as two separate <h2> tags

I'm sure there are going to be more Word-related formatting issues, so what I'd like to do is create some content in my own copy of Word and import it to Google docs, then run through each article and fix whatever comes up. It would also be helpful if there are more example Word docs I could use - having them already imported into Google Docs would be fine.

I've added parsing additional images (any images in the doc) with strange word formatting to my list already.