DB Design: meta-info about texts

We need to know more about the texts we have and the texts we need. This involves a few sides:

Knowing information about the size and structure of text in general. E.g., knowing that Bereishit contains 50 chapters and that chapter 50 of Genesis contains 26 verses.
Knowing summary information about the actual texts and translations we have in DB. E.g, being able to say, we have 100% of the the Hebrew of Bereishit across 3 versions, but that we only have 35% of Mishna Peah in English. This information may be summarized across all the texts we have.
Knowing information about a particular version of a text. E.g., verse for verse knowing whether and by whom a text has been reviewed, or storing ratings for the quality of a particular translations on a segment by segment basis.

Collecting information in (1) maybe be just as difficult as actually getting the text (e.g., counting precisely how many Rashis there are on which dafs of gemara). Handling incomplete information will be a requirement. Being able to provide estimates for sizes will be very helpful for estimating the magnitude of our task.

Sefaria / Sefaria-Project

DB Design: meta-info about texts #34