glottobank / tukano

Repository for computer-guided reconstruction with Jena wordlist standard for Tukano language data
GNU General Public License v2.0
1 stars 0 forks source link

tukano

Repository for computer-assisted reconstruction with Jena wordlist standard for Tukano language data.

News

I have now managed to conduct a first alignment analysis along with tokenization and the like for Tukano data. The results can soon be browsed via this weblink:

The tool which displays the results allows for online editing, provided the user has the password. Otherwise, editing is possible, but nothing will be stored in the database.

Steps I undertook are the following:

Cleaning and IPA conversion

Tokenization / phonological segmentation

The cleaned data wos segmentized phonologically using LingPy's basic functions. The results seem more or less OK, but few questions need to be solved. I have placed them under this link:

Alignment analysis

The current alignment analysis is by no means complete, but just an illustration of what is possible. In the near future, it is planned to include an alignment editor in the app, so it will then be possible to manually correct alignments.

What are the next steps?

The next steps consist of data checking, and spurious error correction:

The next concrete step will be to segmentize the proto-forms (otherwise, we can't do any nice analysis for comparison of reflexes and proto-forms). After this step, we make an initial analysis checking for potential borrowings, just to make sure enough reflexes are represented in all branches, and no really spurious cases might blurr our inferences.