Closed StephanAkkerman closed 2 weeks ago
Translation is necessary for the semantic sim. Transliteration is necessary for the orthographic sim.
If the languages for translation are too little we can use this instead: https://pypi.org/project/deep-translator/#check-supported-languages Plan:
Use our own branch for the most support for languages: https://github.com/StephanAkkerman/py-googletrans/tree/master
Description:
Problem: Currently the supported languages are only written down in a markdown file. We should have this somewhere in the code as a check for later.
Solution: When #42 is done, check which languages can be tranlsated + transliterated. Update the markdown file to show clear what is possible for each language: G2p model leading + translation & transliteration For googletrans package we can simply import the supported language dictionary, for g2p we need to manually write it ourselves probably
Prerequisites: [List any requirements or dependencies needed before starting.]
Tasks:
Additional context Add any other context or screenshots about the feature request here.