teusbenschop / ndebele

The text of the Ndebele Bible for use by the translation team
3 stars 2 forks source link

A counted word list to assist with proof reading the text #43

Open DavidHaslam opened 6 years ago

DavidHaslam commented 6 years ago

The attached Zip file contains a tab delimited counted word list for the text of the Ndebele translation. The analysis was done by means of a bespoke TextPipe filter.

merged.words.count.rev.txt.zip

In removing punctuation, special provision was made to preserve the hyphenated words. The final count duplicate lines filter also sorted the words and is case-sensitive.

This may assist with proof reading the text. Browse the file to look for any mis-spelled words.

The third tab field has the words reversed. When opened with Excel the data can be sorted on column C to get the words in column B in rhyming order. i.e. Words with similar endings are found together. This technique can sometimes be fruitful in finding further anomalies in spellings.

The data analysed is from verse, canonical title and paragraph text, but excludes section headings. Cross-references, footnotes and all USFM tags were first removed.

The analysis can be readily repeated in the future, should the need arise after further corrections.

DavidHaslam commented 6 years ago

Today's merges did not change the word list.