giellalt / bugzilla-dummy

0 stars 0 forks source link

Verses not singled out in fin, sme, smj and swe NT (Bugzilla Bug 321) #62

Closed albbas closed 17 years ago

albbas commented 18 years ago

This issue was created automatically with bugzilla2github

Bugzilla Bug 321

Date: 2006-06-29T09:08:17+02:00 From: Trond Trosterud <> To: Saara Huhmarniemi <>

Last updated: 2006-11-15T11:13:54+01:00

albbas commented 18 years ago

Comment 1049

Date: 2006-06-29 09:08:17 +0200 From: Trond Trosterud <>

These bible versions all are (or swe: can potentially be) formatted as having distinct titles, but verses mingled with the text. What we want is to use the verse formatting as a parallel text aligner clue. The verses must thus be singled out. Cf. also bug #198, and a forthcoming news thread.

Now, the swe bible is dumped at corp/orig/swe/bible/ot/sv.tar, but the tar file consists of both ot and nt files. Until now, we have accidently stored our nt files as one long file, but our ot files as one file per bible book. If this should go on (which it can), the files in sv.tar should be rearranged.

albbas commented 17 years ago

Comment 1190

Date: 2006-11-14 12:09:06 +0100 From: Saara Huhmarniemi <>

The verses and chapter numbers are now singled out from the nt:s for sme, smj and swe. The format could still be refined, the documents are in corp/orig/lang/bible/nt/ -directories with suffix bible.xml. There is also a script bible2xml.pl that converts the documents to a format, where the chapter numbers and section titles are marked as

-elements and verse numbers are not shown at all. That enables treating nt:s as sources of running text. The converted text is stored under bound-hierarchy.

I will continue to unify the structure of available old testament texts.

albbas commented 17 years ago

Comment 1193

Date: 2006-11-15 11:13:54 +0100 From: Saara Huhmarniemi <>

There is now a convention to have the new testament as one big file, whereas the old testament in several files. That is because for nno, nob, and swe, we have only 1.Mos and Psalms.

All the available bible files, except Finnish ot are now converted to the bible format. There are couple of things that should be noted: