srophe / britishLibrary-data

GNU General Public License v3.0
0 stars 3 forks source link

Add. 12,156 | https://bl.syriac.uk/ms/31 PDF contains characters or images which cannot be displayed in XML #1645

Open SPuchkova opened 2 months ago

SPuchkova commented 2 months ago

On p. 466 of Wright, we see Syriac with round black dots that look, for me, like paragraph separators but in Oxygen and on the website those dots are rendered with 'wav' letter. Is it a correct rendering?

https://github.com/srophe/britishLibrary-data/blob/7ed9d834c099761265cdcb643468a8d01e06be53/data/tei/31.xml#L1184

davidamichelson commented 2 months ago

Thanks @SPuchkova, yes you are right, these should not be waw. There is another one on page 647 no. 17 where it seems to be a circle.

image

Please delete the ܘ and then leave this issue open, because we need to mark these as having characters that cannot be displayed. That is done by adding the message: PDF contains characters or images which cannot be displayed in XML to the subject of the issue