lexibank / abvd

CLDF dataset derived from Greenhill et al.'s "Austronesian Basic Vocabulary Database" from 2020.
https://abvd.eva.mpg.de
Creative Commons Attribution 4.0 International
2 stars 2 forks source link

How to handle inconsistent cognateset IDs #1

Closed xrotwang closed 6 years ago

xrotwang commented 6 years ago

The word for "Twenty" in language Palembang Malay is assigned to cognate set 3.6. This may mean 36, or 3 , 6. It certainly is a good example why we shouldn't sluggify identifiers, but how to correct? Fix in the source?

SimonGreenhill commented 6 years ago

I've fixed this in the source (="3, 6"). In terms of lexibank, I think we should be liberal with warnings and raising errors. Perhaps the way here is to produce an error ('unable to parse cognate set "xxx" in language y item z')