HughP / dnj-corpus

A small corpus of a local newspaper
Other
3 stars 2 forks source link

Tone Pattern Stats #1

Closed nqthqn closed 6 years ago

nqthqn commented 6 years ago

This does not include configuration for what tones (H+, H, M, L, L+) correspond with which initial/final word stem isolating characters (", ', `, =,-, etc.).

And this also leaves the last column, Kinds of Other, blank.

I F Lexical String Usage % Kinds of Other
= - klang 60 0.42 ?
= - 39 0.28 ?

Could be

I F Lexical String Usage % Kinds of Other
= - klang 60 0.42 HL:10 LM:4 H+L:1
= - 39 0.28 HL+:3

Once we add the configuration for these, we can modify the script to search for matching stems in other tone patterns.

By the way. There's currently a lot of O(n^2) stuff going on.

HughP commented 6 years ago

I think you did a push and merge without first doing a pull. How do I look at your contributions to the read me file?

HughP commented 6 years ago

Never Mind. I fixed it. and I fixed the tables that were broken.

nqthqn commented 6 years ago

Cool, yeah. I never touched README.md. You can't push and then merge. Merging happens on the client side. Though now on github you can merge branches in the online interface, which is pretty badass :)