Open dlee opened 3 years ago
Thanks for taking time to point these out. This dataset is due for a complete revamp. In the future I plan to link to the individual tagged words in ESV - ie I'll avoid copyright issues by not including in-between untagged words. The dataset is also being updated to tagging that includes all Hebrew prefixes and suffixes. This means, in the short term, I won't be fixing these issues. Sorry!
Thank you for the update. Do you have an estimated timeline for the updated dataset?
ASAP
David IB
On Sat, Mar 13, 2021 at 11:56 AM David Lee @.***> wrote:
Thank you for the update. Do you have an estimated timeline for the updated dataset?
— You are receiving this because you commented. Reply to this email directly, view it on GitHub https://github.com/tyndale/STEPBible-Data/issues/40#issuecomment-798201812, or unsubscribe https://github.com/notifications/unsubscribe-auth/AAM5BOS3YZ42L3CL35NJ4U3TDM5KDANCNFSM4ZDNIOOQ .
There are some lines in TTESV that do not conform to the specified format. I couldn't really figure out how to fix the errors, but they seem to generally fall within the lines of the word index having a
+00
and then a long list of strongs numbers.Some examples:
There's also this line that has a word index of
601
:There's also a line that has an invalid strongs number (
0100419
):I think the last entry is supposed to be
19=<01004+04801>