If the above library works well, we can automatically generate Persian transliterations which could benefit the Golistan project and DLME generally. This would involve adding a new task to our airflow workflow. The task would not apply to all DAGs, only those with Persian script metadata. Since we do the language tagging in traject, we would need to implement this after the transform task.
https://pypi.org/project/PersianG2p/
If the above library works well, we can automatically generate Persian transliterations which could benefit the Golistan project and DLME generally. This would involve adding a new task to our airflow workflow. The task would not apply to all DAGs, only those with Persian script metadata. Since we do the language tagging in traject, we would need to implement this after the
transform
task.