openscriptures / morphhb

Open Scriptures Hebrew Bible
https://hb.openscriptures.org
Other
178 stars 63 forks source link

What is the best way to create a Verse-per-Line version of the Open Hebrew Bible? #75

Open Nainoia-Inc-Admin opened 3 years ago

Nainoia-Inc-Admin commented 3 years ago

I would like to creat a Verse-per-Line copy of the Open Hebrew Bible. What is the best way to do that? Which files should I use from this repository?

pierre-amadio commented 3 years ago

Hello.

You will find the text in xml format for each book in the wlc directory.

python with BeautifulSoup is probably the easiest way to display 1 verse by line for all book.

On Thu, 4 Feb 2021 at 02:37, Jeff Martin notifications@github.com wrote:

I would like to creat a Verse-per-Line copy of the Open Hebrew Bible. What is the best way to do that? Which files should I use from this repository?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/openscriptures/morphhb/issues/75, or unsubscribe https://github.com/notifications/unsubscribe-auth/AJJS36DQPQFGPHXFYNYPYKDS5H27HANCNFSM4XB5EWZA .

DavidIB commented 3 years ago

Do you mean the one by OpenSriptures? This is essentially the same as the Westminster transcription of the Leningrad text. YOu can get that data from the OpenScriptures site - https://github.com/openscriptures/morphhb

The TOTHT https://github.com/tyndale/STEPBible-Data text in our repository is virtually the same as this text, though with a few corrections. Extracting the verse-per-line Hebrew was not a difficult task for me, so created the Hebrew text from TOTHT. You can pick it up at https://www.dropbox.com/s/bj30roa2h4uar1o/TOTHT%20Hebrew-only%20text.txt?dl=0 I hope it is useful for you.

Every blessing

David IB

On Thu, Feb 4, 2021 at 2:08 AM Jeff Martin notifications@github.com wrote:

I would like to creat a Verse-per-Line copy of the Open Hebrew Bible. What is the best way to do that? Which files should I use from this repository?

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/openscriptures/morphhb/issues/75, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAM5BOQLAWZAX7CJAPXYP3TS5H27HANCNFSM4XB5EWZA .

Nainoia-Inc-Admin commented 3 years ago

Thanks! Questions...

To convert the TOTHT to VPL is it simply a matter of concatenating the Hebrew words together with a space in between for each verse? Also is the TOTHT updated when this OpenScriptures project is updated?

Now in the OpenScriptures/morphhb project I see three folders that seem to contain the Hebrew text: 1) /MAPM/ 2) /wlc and 3) /oxlos-import.

The MAPM/MAPM.xml file already seems to be close to VPL format. I just need to know what to do with the -SEG- tags. I need the Hebrew text in VPL without any tags. How do I strip out the -SEG- tags?

The /wlc/bible books file format has even more tags. I would need help to know how to strip out all the tags to get to VPL format.

And finally the /oxlos-import format is just a long list of all the Hebrew words with verse and word number. This file format would be the easiest to use to make a VPL format. Do I just concatenate the words together with a space in between? And which file in the /oxlos-import is best to use?

And of the three options above can each be used to produce the same best Open Scriptures Hebrew bible or is one folder the better to use?

DavidTroidl commented 3 years ago

/wlc is the latest OSHB text. /MAPM is a completely different project. /oxlos-import has no scripture files in it.

Nainoia-Inc-Admin commented 3 years ago

It seems to me that the VPL format could easily be constructed with /oxlos-import/wlc.txt. Would it work to simply concatenate all the Hebrew words in that file together per verse separated by a space?

DavidTroidl commented 3 years ago

Correction on my previous post. /oxlos-import is 11 years old. I would not consider it reliable. A lot has been updated since then.

Nainoia-Inc-Admin commented 3 years ago

Okay thanks that is what I needed to know.

I plan to use this as my source for the Open Scriptures Hebrew Bible, https://ebible.org/details.php?id=hboWLC I think that is a well groomed source.