christos-c / bible-corpus

A multilingual parallel corpus created from translations of the Bible.
Creative Commons Zero v1.0 Universal
175 stars 47 forks source link

Empty verses in English Web Bible #20

Closed morethanbooks closed 8 months ago

morethanbooks commented 8 months ago

In this Bible, there are 7 verses without text:

christos-c commented 8 months ago

These are actually empty in the WEB text (e.g. ACTS 15), not sure if it's better to remove the verse <seg>s or leave them blank.

morethanbooks commented 8 months ago

Thanks for checking. I will leave them empty as they are right now. Perhaps state this somewhere in the header of the element to make document explicitly that you are aware of that.

christos-c commented 8 months ago

Thanks for the suggestion José. Is there a guide for how to do this? Should I use the sourceDesc tag?

morethanbooks commented 8 months ago

Yes, I guess that would be a good place. However, I am not completely sure.

christos-c commented 8 months ago

After going through the CES documentation it looks like the easiest place would be the projectDesc tag since it's unstructured and meant to convey general information. I'll add the note in that tag for this and the other translations.