benwbrum / fromthepage

FromThePage is a wiki-like application for crowdsourcing transcription of handwritten documents.
http://fromthepage.com
GNU Affero General Public License v3.0
170 stars 50 forks source link

table markup with sub/sup text tags loses rows #1679

Open saracarl opened 4 years ago

saracarl commented 4 years ago

A customer ran into a problem with the following table markup:

MACHINERY REPAIRS........................349 | SURPLUS................................. 4<sub>Stockholders Liquidation</sub> -------- 3 |
MARQUEE..................................32 | <sup>TAX Withheld-----------------------107</sup>TAXES...................................104 |
MARQUEE REPAIRS..........................57 | TELEPHONE...............................270 |

I stripped out the subject mark-up, the sub and sup tags and the plain table saved and displayed fine:

MACHINERY REPAIRS........................349 | SURPLUS................................. 4Stockholders Liquidation -------- 3 |
MARQUEE..................................32 | TAX Withheld-----------------------107TAXES...................................104 |
MARQUEE REPAIRS..........................57 | TELEPHONE...............................270 |

I then added in the subject link around Robert Simon. That text saved and displayed just fine.

When I added in the first sub, though, the table text dropped to just 2 two lines (the ones starting with MACHINERY and MARQUEE REPAIRS).

Here's the really interesting bit: Even if I replaced the transcription text with the original plain transcript, with no subject or sub/sup tags, it did not display the page correctly.

The version should have been overwritten completely, but it wasn't, somehow.

(I'm noticing that GitHub isn't showing a preview for the table with the markup, even in triple-quotes. Maybe this is a bug in markdown?)

saracarl commented 4 years ago

My reproduce: https://fromthepage.com/saracarl/iiif-showcase-2018/letter-from-emma-davis-to-john-c-brewer-february-6-1876-7f00702f-5c0f-47a6-9fca-931a758d6c21/display/315492

saracarl commented 4 years ago

We need to try to restore a plain text (no sub/sup tags) version of this page: https://fromthepage.com/carnegiehallarchives/music-hall-company-of-new-york-accounting-ledgers/volume-6/display/950374