lb42 / KJV_1611

A TEI-Conformant version of the 1611 text of the Bible
7 stars 0 forks source link

Chapter labels in Psalms #4

Open DavidHaslam opened 6 years ago

DavidHaslam commented 6 years ago

I might be mistaken, but based on my knowledge of the Blayney 1769 edition, the chapter labels in Psalms are probably in the form PSAL. I. rather than CHAP. I..

cf. Yours are like:

<div xmlns="http://www.tei-c.org/ns/1.0" xml:id="CPsalms_01" type="chapter">
<pb facs="http://www.kingjamesbibleonline.org/1611-Bible-KJV/Psalms-Chapter-1-3.jpg"/>
<head>CHAP. I.</head>
<head>1 The happinesse of the godly. 4 The
vnhappinesse of the vngodly.</head>
lb42 commented 6 years ago

The text you see ("chap. 1" etc) is the text I found in the HTML source I was converting. I agree with you that in the original these labels read "Psalme 1" etc. -- as can be seen by looking at the page image cited in the link http://www.kingjamesbibleonline.org/1611-Bible-KJV/Psalms-Chapter-1-3.jpg above.

So fixing this would be yet another thing worth doing, but requiring human intervention... Feel free to fork!

DavidHaslam commented 6 years ago

Not quite ready to fork. And I probably wouldn't find your scripts as ready to use on my Windows 7 platform.

I am assuming that you're on Linux or similar, based on the EOL styles in the XML files, and your use of commands such as curl.

Even so, if I discover a improvement task that matches my skills and software tools, that's certainly something for me to leave open.

DavidHaslam commented 6 years ago

It may be sensible to insert an attribute in the head elements to distinguish between these two types:

It's evident that 16 chapters lack such a description! These are located as follows:

I wonder why?

DavidHaslam commented 6 years ago

FIO. There are 2331 fw elements in total acting as section headings. On average, that's roughly 1.71 per chapter.

lb42 commented 6 years ago

There are actually three different contexts in which <head> is used : I added a heading for each book; the HTML provides a "chap" heading; and (with the exceptions you note) this is usually followed in the source by a kind of summary of the chapter content. It would be entirely appropriate to add a @type attribute to distinguish these three cases, and probably not hard to do automatically. The things I have marked as <fw> elements are typographically distinct in the HTML: they usually correspond with a page title, i.e. something added to the forme (hence fw, for forme work) for a particular page. I havent checked to see if they ever appear elsewhere.

lb42 commented 6 years ago

Ecclesiasticus 51 lacks a summary/description in the HTML, though not in the source ("A prayer of Jesus sonne of Sirach"): that's a bug that should be fixed. Prayer of Manasseh has a summary which in the HTML source is included as part of the first verse, but enclosed in square brackets: probably that should be fixed too. As for Proverbs 11-24, there's (sort of) an answer to your question at the start of Proverbs 10 "From this Chapter to the fiue and twentieth, are sundry obseruations of morall vertues, and their contrary vices."

lb42 commented 6 years ago

Fixed some of these at c62d87f ; the rest will be fixed when the fixup script runs.