usfm-bible / tcdocs

Technical Committee Documents
Other
9 stars 9 forks source link

Book Numbers - and NT numbering #3

Closed jonathanrobie closed 7 months ago

jonathanrobie commented 2 years ago

MARBLE and many other internal numbering systems use 40 to identify Matthew, e.g.

<MARBLELinks xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xmlns:xsd="http://www.w3.org/2001/XMLSchema">
  <MARBLELink Id="04000100100000">
    <ThematicLinks />
    <LexicalLinks />
    <TextualLinks />
    <ImageLinks>
      <ImageLink>tb042403039</ImageLink>
      <ImageLink>Genealogy_of_Jacob</ImageLink>
      <ImageLink>Genealogy-Jesus</ImageLink>
      <ImageLink>JesseTree</ImageLink>
      <ImageLink>AT Map00305</ImageLink>
    </ImageLinks>
    <MapLinks />
    <ArticleLinks />
    <SectionLinks>
      <SectionLink>GNSBUK001639</SectionLink>
    </SectionLinks>
  </MARBLELink>
  <MARBLELink Id="04000100100002">
    <ThematicLinks />
    <LexicalLinks>
      <LexicalLink>SDBG:βίβλος:000002:Communication</LexicalLink>
    </LexicalLinks>
    <TextualLinks />

But the USFM specification says the book code for Matthew is 041. What is the intended use of that number? Is anything broken here?

I am creating a lot of datasets that need to know whether to treat Matthew as 40 or 41 internally, so I'd like to know if this is an issue ...

joelthe1 commented 2 years ago

I had asked a related question sometime back if that is helpful: https://github.com/ubsicap/usfm/issues/99

jwickberg commented 2 years ago

I think the only place that MAT = 41 is used in Paratext is in the file name of the book text.

The versification code we use from the LibPalaso library has MAT = 40

I don't know of any where in USFM/USX where book number is used.

jonathanrobie commented 2 years ago

Both LibPalaso and MARBLE use Matthew = 40. Most of the resources I have use Matthew=40 if they put NT books into the 40s.

On Mon, Feb 28, 2022 at 12:00 PM jwickberg @.***> wrote:

I think the only place that MAT = 41 is used in Paratext is in the file name of the book text.

The versification code we use from the LibPalaso library has MAT = 40

I don't know of any where in USFM/USX where book number is used.

— Reply to this email directly, view it on GitHub https://github.com/usfm-bible/tcdocs/issues/3#issuecomment-1054464041, or unsubscribe https://github.com/notifications/unsubscribe-auth/AANPTPP6F4JZEJIAQZYQ2N3U5OS3NANCNFSM5PLL6IVA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

mhosken commented 2 years ago

Ptxprint follows paratect in using 41. This is a potential mess. We need to agree and implement. I don't want to have to translate between them. If we pick 40, that is going to cause quite a bit of pain, even if 40 is a better answer.

I propose we use 41 until paratext can make the change. I don't see that happening soon though. The cost of transition seems high to me.

Sigh. GB, Martin

On Tue, 1 Mar 2022, 00:04 Jonathan Robie, @.***> wrote:

Both LibPalaso and MARBLE use Matthew = 40. Most of the resources I have use Matthew=40 if they put NT books into the 40s.

On Mon, Feb 28, 2022 at 12:00 PM jwickberg @.***> wrote:

I think the only place that MAT = 41 is used in Paratext is in the file name of the book text.

The versification code we use from the LibPalaso library has MAT = 40

I don't know of any where in USFM/USX where book number is used.

— Reply to this email directly, view it on GitHub https://github.com/usfm-bible/tcdocs/issues/3#issuecomment-1054464041, or unsubscribe < https://github.com/notifications/unsubscribe-auth/AANPTPP6F4JZEJIAQZYQ2N3U5OS3NANCNFSM5PLL6IVA

. Triage notifications on the go with GitHub Mobile for iOS < https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675

or Android < https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub .

You are receiving this because you authored the thread.Message ID: @.***>

— Reply to this email directly, view it on GitHub https://github.com/usfm-bible/tcdocs/issues/3#issuecomment-1054468277, or unsubscribe https://github.com/notifications/unsubscribe-auth/ABLMO3MMONVEWOJ5KN3GNMLU5OTLFANCNFSM5PLL6IVA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you are subscribed to this thread.Message ID: @.***>

jonathanrobie commented 2 years ago

Where does Paratext use 41?

On Mon, Feb 28, 2022 at 12:45 PM mhosken @.***> wrote:

Ptxprint follows paratect in using 41. This is a potential mess. We need to agree and implement. I don't want to have to translate between them. If we pick 40, that is going to cause quite a bit of pain, even if 40 is a better answer.

I propose we use 41 until paratext can make the change. I don't see that happening soon though. The cost of transition seems high to me.

Sigh. GB, Martin

On Tue, 1 Mar 2022, 00:04 Jonathan Robie, @.***> wrote:

Both LibPalaso and MARBLE use Matthew = 40. Most of the resources I have use Matthew=40 if they put NT books into the 40s.

On Mon, Feb 28, 2022 at 12:00 PM jwickberg @.***> wrote:

I think the only place that MAT = 41 is used in Paratext is in the file name of the book text.

The versification code we use from the LibPalaso library has MAT = 40

I don't know of any where in USFM/USX where book number is used.

— Reply to this email directly, view it on GitHub <https://github.com/usfm-bible/tcdocs/issues/3#issuecomment-1054464041 , or unsubscribe <

https://github.com/notifications/unsubscribe-auth/AANPTPP6F4JZEJIAQZYQ2N3U5OS3NANCNFSM5PLL6IVA

. Triage notifications on the go with GitHub Mobile for iOS <

https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675

or Android <

https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub

.

You are receiving this because you authored the thread.Message ID: @.***>

— Reply to this email directly, view it on GitHub https://github.com/usfm-bible/tcdocs/issues/3#issuecomment-1054468277, or unsubscribe < https://github.com/notifications/unsubscribe-auth/ABLMO3MMONVEWOJ5KN3GNMLU5OTLFANCNFSM5PLL6IVA

. Triage notifications on the go with GitHub Mobile for iOS < https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675

or Android < https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub .

You are receiving this because you are subscribed to this thread.Message ID: @.***>

— Reply to this email directly, view it on GitHub https://github.com/usfm-bible/tcdocs/issues/3#issuecomment-1054507131, or unsubscribe https://github.com/notifications/unsubscribe-auth/AANPTPN7UYGJ624XRMIKHG3U5OYFFANCNFSM5PLL6IVA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

jwickberg commented 2 years ago

I found 2 places looking in the Paratext code:

In both of these cases, all the numbers for NT books are shifted up 1.

jonathanrobie commented 2 years ago

So in the names of identifiers within a file, Matthew=40, in filenames, Matthew=041, and that's fairly consistent in Paratext?

Jonathan

On Mon, Feb 28, 2022 at 3:20 PM jwickberg @.***> wrote:

I found 2 places looking in the Paratext code:

  • the name of the USFM file for a book
  • the name of the HTML file for a book when exporting a project as HTML

In both of these cases, all the numbers for NT books are shifted up 1.

— Reply to this email directly, view it on GitHub https://github.com/usfm-bible/tcdocs/issues/3#issuecomment-1054628514, or unsubscribe https://github.com/notifications/unsubscribe-auth/AANPTPKQRFMX23CBBWT4TXTU5PKINANCNFSM5PLL6IVA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

You are receiving this because you authored the thread.Message ID: @.***>

jwickberg commented 2 years ago

Except for the references you found in Marble data, references in Biblical Terms data, and book numbers in progress data (maybe a couple of others), the default is to use book id rather than book number. When book number is used internally, it has MAT=40.

The \id line and values for for parsed references in USX use the book id.

So, I think the only inconsistency is in file names where 41 was used for historical reasons.

mhosken commented 2 years ago

Thinking about this, I don't think it is in scope for USFM/USX. I think this is a scripture burrito question. I therefore propose closing this issue once someone has transferred it to that project. Please could whoever does the transfer, respond or close this issue.

jonathanrobie commented 2 years ago

I don't think it SHOULD be in scope for USFM/USX, but it is, because our spec currently seems to define it here:

https://ubsicap.github.io/usfm/identification/books.html

In that table, does the NUMBER column have a normative meaning? If so, what is that meaning? The only definition given is this:

Some scripture editors like Paratext http://paratext.org/ may also

include the number shown for each book in the filename for that book.

But it doesn't say, for instance, that Paratext uses different numbers internally, and doesn't even mention the numbering scheme used by Paratext and MARBLE. I think we're slightly pregnant here, we need to get to a consistent state.

On Tue, Mar 22, 2022 at 1:01 AM mhosken @.***> wrote:

Thinking about this, I don't think it is in scope for USFM/USX. I think this is a scripture burrito question. I therefore propose closing this issue once someone has transferred it to that project. Please could whoever does the transfer, respond or close this issue.

— Reply to this email directly, view it on GitHub https://github.com/usfm-bible/tcdocs/issues/3#issuecomment-1074728546, or unsubscribe https://github.com/notifications/unsubscribe-auth/AANPTPK7LNR7PXVMSQV3KJDVBFH2DANCNFSM5PLL6IVA . You are receiving this because you authored the thread.Message ID: @.***>

mhosken commented 2 years ago

I propose we don't carry that numbering forward, declare it non-normative and turf it over the wall to scripture burrito :) Those numbers never occur in USFM/USX data (he says, oh so confidently) so can only be informative if listed in the current docs.

mhosken commented 2 years ago

I propose we don't carry that numbering forward, declare it non-normative and turf it over the wall to scripture burrito :) Those numbers never occur in USFM/USX data (he says, oh so confidently) so can only be informative if listed in the current docs.

KentSpiel commented 2 years ago

From the Biblica perspective, we use the numbers one through 66 and remember all of our files beginning with Matthew to start with 40. Since the point of our current mission is to document the current state of USFM/US, I propose that we include the current numbering system. At the same time, I propose we also approve any numbering system that includes a 2-digit number and a 3-letter book code.

jonathanrobie commented 2 years ago

As far as I know, most projects use the Paratext numbering system, largely because most USFM is created by Paratext and the numbering is created automatically. I propose that we document that as the most common numbering, deprecate the current documentation of NT numbering, but not forbid other numbering.

jonathanrobie commented 1 year ago

Joel and I were asked to talk to the Scripture Burrito working group about book numbers and other metadata that spans individual USFM files.

We need to define these things in Scripture Burrito, and have already defined ways to declare book order, including multiple orders, using ordered lists (https://docs.burrito.bible/en/latest/schema_docs/scope.html). This means that the same content can be published in multiple orders, e.g. in a Catholic edition, an Orthodox edition, and a Protestant edition. We do not use book numbers for this.  We do not believe they are necessary.   Different agencies use 40 or 41 for Matthew, but this does not cause any real interoperability problems.  After all, they also spell Matthew differently, depending on the language. 40-Matthew or 41-Matthias are both valid.

Scripture Burrito is an exchange format, not a persistent file format, though there is no technical barrier for an application that wants to use it that way. We are happy to have applications use Scripture Burrito to specify metadata for a set of files anywhere they need to.

mhosken commented 7 months ago

I think we've resolved this