arizona-linguistics / colrc-v2

COLRC version 2.0
5 stars 2 forks source link

Roots data cleanup #277

Open amyfou opened 1 year ago

amyfou commented 1 year ago

The data in our roots table was extracted and normalized for storage in our database tables by a series of python scripts. It was complicated, so there are some errors. These include things like unmatched parentheses, extra periods, and other obvious mistakes - mostly in the 'grammar' column. It also includes initial upper-case letters in the Nicodemus column entries, these should be lower-cased.

kyuriousity commented 11 months ago

cleaned_pulic_roots.csv

Cleaned exported public roots

For the future work: