lingdb / Sound-Comparisons

Exploring phonetic diversity across language families —
http://www.soundcomparisons.com
Other
13 stars 8 forks source link

Where do I find the actual data on phonetic transcriptions? #461

Closed LinguList closed 7 years ago

LinguList commented 7 years ago

In order to help by giving recommendations on best practice of phonetic transcriptions (I see they are currently just taken unchecked from the transcribers, but do you at least normalize to one unicode version?), I need to be pointed to the raw data. Is this uploaded in some project? Do you plan on uploading it? And how do you get it into the SoundComparisons? This remains a mystery for me, but it is crucial if you want to increase the synergies with our work on standardization.

LauraWae commented 7 years ago

Hi Mattis,

In general, we use SQLs to upload transcriptions to the website. I will send you such an example per E-Mail, as Github does not allow Excel-Sheets. Paul creates those sheets. Transcribers enter their data to it.

I do not know if you have access to the Owncloud Folder System of Soundcomparisons. Most (!) of the raw transcriptions can be found there. Please tell me if you need further indications.

LinguList commented 7 years ago

Thanks, this is indeed really helpful. I'd suggest to share these things online, or even better, to have them converted to regular csv-files (with the info inside) and then track them on github. This is also good for versionizing. If those excel things are converted to the database representation anyway, it is a no-brainer for a programmer in doing the same conversion to the cldf formats, we are currently developing.