MaryIsbell / TheIllustratedArcticNews

A digital edition of The Illustrated Arctic News
0 stars 1 forks source link

How to encode Ampersands versus plus signs #5

Closed noorka closed 7 years ago

noorka commented 8 years ago

Should we encode + as & or just write a +? There are instances of both & and + used in place of the word and. As of the moment I have encoded any symbol form of the word and as &

MaryIsbell commented 8 years ago

I think the best solution is to encode plus signs as abbreviations for and, but I only want to do this if we can be sure that TAPAS will transform it that way. So, this requires a little testing. If there is already an XML file on TAPAS that uses the encoding, we should see if the diplomatic/normalized views toggle between the two bits of data encoded. If that isn't on TAPAS yet, we could start testing now to see if that will work. Better to do that testing before encoding the rest of the issue.

The simpler solution would just to say that all plus signs are actually ampersands, but that strikes me as inaccurate and less forgivable in this instance because page images won't be available.

noorka commented 8 years ago

With this in mind I am leaving plus signs as + in the transcription and encoding any & as & just to make it easier in the future when we decide how we will actually solve this issue.