undertheseanlp / underthesea

Underthesea - Vietnamese NLP Toolkit
http://undertheseanlp.com
GNU General Public License v3.0
1.37k stars 271 forks source link

List of Vietnamese syllables with IPA #623

Open rain1024 opened 1 year ago

rain1024 commented 1 year ago

Plan

  1. Gather resources:

    • [x] Collect Vietnamese dictionaries, linguistic studies, and other resources that contain information on Vietnamese syllables.
  2. Analysis of the resources:

    • [ ] Analyze the resources collected in step 1 to extract information on the syllables.
    • [ ] Extract the syllables and note any variations or irregularities.
  3. Syllable inventory:

    • [ ] Based on the information extracted in step 2, create an inventory of Vietnamese syllables.
    • [ ] Include the basic set of syllables and any variations or irregularities noted.
  4. Verification:

    • [ ] Verify the accuracy of the syllable inventory by having native speakers of Vietnamese listen to recordings of the syllables.
    • [ ] Make any necessary corrections based on feedback received.
  5. Organize and store the data:

    • [ ] Organize the syllable inventory in a format that is easy to access and use.
    • [ ] Store the data in a format such as CSV or JSON.
    • [ ] Consider using a database management system such as MongoDB to store and manage the data.
  6. Share and publish the data:

    • [ ] Share the syllable inventory with the research community and make it publicly available.
    • [ ] Publish the data on a website or submit it to a linguistic data repository.
  7. Maintaining and updating the list:

    • [ ] Regularly update the syllable inventory with new syllables, correct errors, and update the list as needed.