cldf-clts / clts-legacy

Cross-Linguistic Transcription Systems
Apache License 2.0
4 stars 3 forks source link

calculate overlap between the transcription data #74

Closed LinguList closed 6 years ago

LinguList commented 6 years ago

This will be useful for statistics, but also to see how much overlap we actually find.

LinguList commented 6 years ago

Interesting results (see statistics.md) and here (shared sounds per dataset):

TD1 TD2 sounds percent
phoible eurasian 619 0.28
phoible pbase 528 0.29
phoible lapsyd 540 0.33
eurasian pbase 453 0.29
eurasian lapsyd 417 0.29
pbase lapsyd 359 0.34