Eastern Dan [dnj] corpus and text input analysis
Sections
Corpora in Languages
- A medium size corpus in Eastern Dan [dnj]
- The Eastern Dan corpus consists of issues of a local newspaper (˗Pamɛbhamɛ), and medical counsels (chapters) from While waiting for a medical doctor translated into Eastern Dan. A detailed description of the corpus is available here.
- A parallel corpus (the New Testament book of James) in following languages: Spanish [spa], French [fra], English [eng], Eastern Dan [dnj], Me'phaa [tcf], Mandarin [cmn], plus two Nigerian languages [bkv], and [eza].
Writing System Descriptions
/Writing-System-Descriptions
The focus here is on the Eastern Dan writing system in general but some description also exists for the other languages in the parallel corpus. In the process of attempting to describe Eastern Dan's text input options, this corpus description also attempts a modest application of the principles set out by Martin Hosken for Writing System Descriptions. A specific analysis of the text input options for Eastern Dan also exists under the Text Input Systems and their Descriptions section.
Text Input Systems and their Descriptions
/Text-Input-Systems-and-Descriptions
Information about several keyboard layouts exists as part of this corpus.
Tools
A variety of tools were used in this project. I try to keep a list and a copy of them in this project.
Tools for Text
/Tools/Tools-for-Text
include:
- Measuring diacritic density and tone melodies
- Text character counting
- Converting or Transforming Text
Tools for Keyboard Layout analysis
Tools for Descriptions