open-dsl-dict / ipa-dict-dsl

IPA Pronunciation Dictionaries in DSL format
MIT License
40 stars 5 forks source link

Adding new languages, living and historical #1

Open nicolasdevops opened 2 years ago

nicolasdevops commented 2 years ago

I have 2 questions.

I think it's quite odd that Italian, Russian and Hebrew dictionaries be missing here. Did you begin work on those and have existing elements I could use to create them?

I have made dictionaries of Ancient Greek and Ancient Latin with IPA that were compiled from the Tufts Perseus datasets, using current proposed IPA rules. Do you think it might be relevant?

dohliam commented 2 years ago

@nicolasdevops This sounds like it might be a better issue for the main repo from which the data here is derived, but I can answer your questions here briefly.

  1. The dictionary data in this ipa-dict project has been provided by volunteers, either manually generated or based on information that is available and freely licensed. If you don't see a particular language dictionary here it's because no one has stepped forward to create one. Having said that, if you check out this issue you will see that there has been some progress already on an Italian dictionary, so perhaps you could follow up there to see if there is anything useful. As I recall the generated dictionary files mostly just needed to be reviewed for errors, so any help in that regard would be greatly appreciated.
  2. Yes, IPA dictionaries for any and all languages would be very welcome in the project. We might want to mark them somehow with the particular pronunciation they are using, since e.g. Latin has a number of different (current and past / prescriptive and descriptive) pronunciation standards. I will have to check the ISO standard to see if there are codes that could accommodate this. We can worry about that later though -- in the meantime, the best thing to do would be to create a pull request over at the ipa-dict project -- once that's done the new dictionaries can be automatically converted to DSL format and included here as well.